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(54) Title: DNA ENCODING ACYLCOENZYME A: CHOLESTEROL ACYLTRANSFERASE AND USES THEREOF 
(57) Abstract 

This invention provides an isolated nucleic acid which encodes an acylcoenzyrne A: cholesterol acyt transferase II or III Specifically 
this invention provides an isolated nucleic acid which encodes a human wildtype" acylcoenzyrne A: 
This invention also provides various methods for inhibiting wildtype acylcoenzyrne A: cholesterol iicyltrans^ 

S SSff^.'nl^ ? ? entifyin * * Chemica ' « whlh is capable of inhibiting ^ 

™ nv/n pharmaceutical compositing comprising of the chemical compound identified by the above-described method 

This invention also provides a method of treating a subject who has atherosclerosis or hyperlipemia. aescriDed method. 
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DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 
ACYLTRA NSFERASE AND USES THEREO F 

This application is a Continuation- In-Part cf U.S. Serial 
No. 08/657,620, filed May 30, 1996, the content of which 
is incorporated by reference into this application. 

10 Throughout this application, various publications are 

referenced by Arabic numerals. Full citations for these 
publications may be found listed at the end of the 
specification. The disclosures of these publications in 
their entireties are hereby incorporated by reference 
into this application in order to more fully describe the 
state of the art as known to those skilled therein. 
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Background of the Invention 

Cholesterol or related sterols, required for the 
viability of eukaryotic cells, exist in the free form or 
as esters conjugated to fatty acids. The concentration 
of free sterol determines the fluidity of eukaryotic cell 
membranes, whereas esterified sterols cannot participate 
in membrane assembly. The esterif ication of 

25 intracellular sterol, mediated in mammals by the 

membrane-bound enzyme, acylcoenzyme A: cholesterol 
acyltransferase, is thus ' a critical homeostatic 
determinant of membrane function (1, 2) . For example, 
cholesterol depletion of the rough endoplasmic reticulum 
30 ( E R) relative to the smooth ER (3) , may modulate protein 

translocation or membrane-associated transcriptional 
activators such as the Sterol Response Element Binding 
proteins (SREBP, 4). m addition, production of 

cholesterol ester (CE) by acylcoenzyme A: cholesterol 
35 acyltransferase in the rough ER may influence the 

transport of sterol between intracellular pools. Similar 
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ester i f ication activities have been observed in other 
eukaryotes such as plants and yeasts (5) . 

Elevations in acylcoenzyme A: cholesterol acyltransf erase 
5 activity perturb several pathways that contribute to 

hyper lipidemia and atherosclerosis. Sterol 
esteri f ication modifies the activity of the low density 
lipoprotein (LDL) receptor and alters serum lipoprotein 
composition to be pro-atherogenic (6, 7). It may also be 

10 a rate limiting step in intestinal sterol absorption (8). 

Furthermore, CE deposition in the arterial wall is an 
important initial step in atherogenesis (9). The 
understanding of the acylcoenzyme A: cholesterol 
acyltransf erase reaction has been hampered by the 

15 difficulty of biochemical purification and by a poor 

grasp of the relevant genetic determinants. A human 
acylcoenzyme A: cholesterol acyl trans f erase I gene from 
macrophages was identified by complementation of Chinese 
Hamster Ovary cell lines deficient in acylcoenzyme A: 

20 cholesterol acyl transferase activity (10) and was 

functionally expressed in insect cells devoid of 
endogenous activity (11). 



25 
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Summarv of the Invention 

This invention provides an isolated nucleic acid which 
encodes an acylcoenzyme A: cholesterol acyltransf erase II 
5 or an acylcoenzyme A: cholesterol acyltransf erase III. 

This invention also provides a vector which includes the 
isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransf erase II or an acylcoenzyme A: 
cholesterol acyltransf erase III and a hose vector system 
which includes a vector. 

This invention also provides a method of producing a 
x polypeptide which comprises growing such host vector 

15 system of claim 14 under suitable conditions permitting. 

production of the polypeptide and recovering the. 
polypeptide so produced. This invention also provides a 
purified wiidtype acylcoenzyme A: cholesterol 
acyltransferase II or an acylcoenzyme A: cholesterol 

20 acyltransferase III. 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 

25 nucleic acid which encodes a wiidtype acylcoenzyme A: 

cholesterol acyltransferase II or an acylcoenzyme A: 
cholesterol acyltransferase III without hybridizing to a 
nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransferase II or an acylcoenzyme A: 

30 cholesterol acyltransferase III. This invention also 

provides an oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic acid 
which encodes a mutant acylcoenzyme A: cholesterol 

35 acyltransferase II or an acylcoenzyme A: cholesterol 
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acyltransf erase III without hybridizing to a nucleic acid 
which encodes a wildtype acylcoenzyme A: cholesterol 
acyltransferase II or an acylcoenzyme A: cholesterol 
acyltransferase III. 

5 

This invention also provides a method for determining 
whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
es ter i f ication of sterol and for treating a subject who 
10 has an imbalance in sterol levels due to a defect in 

ester i f ication of sterol. 



This invention also provides methods for inhibiting 
wildtype acylcoenzyme A: cholesterol acyltransferase II 
15 or an acylcoenzyme A: cholesterol acyltransferase III in 

a subject. 

This invention also provides a method for identifying a 
chemical compound which is capable of inhibiting 
20 acylcoenzyme A: cholesterol acyltransferase II or an 

acylcoenzyme A: cholesterol acyltransferase III in a 
subject and a pharmaceutical composition comprising of 
the chemical compound so identified. 



25 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyltransferase II or an 
acylcoenzyme A: cholesterol acyltransferase III. 
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Brief Descript ion of the Fig nrp^ 

• Abbreviations: The amino acid residues are abbreviated 
as follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, 
Gly; H, His; I, He; K, Lys; L, Leu; M, Met; N, Asn; P, 
Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and 
Y, Tyr. CON: consensus sequence. 

Figures 1A and IB. Protein sequence alignments predicted 
from candidate genes for the human acylcoenzyme A: 
cholesterol acyl transferase gene I, the yeast homologs, 
acylcoenzyme A : cholesterol acyl trans ferase-rela ted 
enzyme I and acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II, and a consensus sequence of all three 
15 sequences. : 

Identical residues between ail the sequences 
are in bold face. Residues of the candidate 
leucine zipper heptad motif are italicized. 
Potential transmembrane domains were identified 
20 at residues 132 to 155 and 460 to 483; 186 to 

202 and 406 to 421; and 215 to 231 and 439 to 
451, for human acylcoenzyme A: cholesterol 
acyltransf erase (Sequence I.D. No.: 2), 
acylcoenzyme A: cholesterol acyl trans f erase- 
25 related enzyme I (Sequence I.D. No.: 4) and 

acylcoenzyme A: cholesterol acyl trans ferase- 
related enzyme II (Sequence I.D. No.: 6), 
respectively. The firefly luciferase signature 
sequences identified in human acylcoenzyme A: 
cholesterol acyl transferase I (10) were not 
conserved in the yeast genes. CON (Sequence 
I.D. No.: 13) denotes the consensus sequence 
between the sequences of human acylcoenzyme A: 
cholesterol acyltrans f erase, acylcoenzyme A: 



30 
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cholesterol acyl transf erase-related enzyme I 
and acylcoenzyme A: cholesterol 

acyltransf erase-related enzyme II. RC7932 
denotes the partial sequence of another human 
5 acylcoenzyme A: cholesterol acyltransf erase 

candidate cDNA (residues 500 to 600) {Sequence 
I.D. No.: 14). The asterisks indicate the 
residues in R07932 identical to those of the 
other sequences. 

10 , 1A. Alignment of amino acid residues 1-362 of 

acylcoenzyme A: cholesterol acyl trans f erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyl trans f erase- 
related enzyme II, human acylcoenzyme A: 

15 cholesterol acyl transf erase and CON. 

IB. Alignment of amino acid residues 363-611 of 

acylcoenzyme A: cholesterol acyl trans f erase- 
related enzyme I and the identical residues in 
acylcoenzyme A: cholesterol acyl trans ferase- 

20 related enzyme II, human acylcoenzyme A: 

cholesterol acyl trans f erase and CON. 

Figures 2A, 2B, 2C, 2D and 2E . Construction and analysis 
of acylcoenzyme A: cholesterol acyl transf erase genes and 
2 5 deletion mutants. 

2A. The arelANA deletion. The schematic depicts a 

fragment from yeast chromosome III in plasmid 
pH3(34). Strategic restriction endonucleases 
are indicated (H, Hind III; B, Bam HI) . 
30 2B. The autoradiogram depicts Bam HI digested DNA 

from wild-type or disrupted diploid strains 
probed with the 2993-bp Bam-HI fragment. This 
produced a fragment corresponding to the 
wild-type acylcoenzyme A: cholesterol 
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acyltransferase-related enzyme I locus and a 
1984-bp fragment characterizing the arelA NA 
allele. The diploid is heterozygous for the 
acylcoenzyme A: cholesterol acyltransf erase- 
related enzyme I deletion. 

Reduced stringency hybridization of yeast 
genomic DNA with acylcoenzyme A: cholesterol 
acyltransferase-related enzyme j coding 
sequences. Genomic DNA from wild-type or 
ARE1 / arelANA diploids were reprobed with an Nhe 
I-Avr II fragment corresponding to the 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I open reading frame ( "ORF" ) . 
Hybridizations and washes were performed at 
60°C in the absence of formamide. 
The are2A deletion. In step 1, PGR amplifying 
oligonucleotides, KO-5' and KO-3' and a LEU2 
template were used to produce the selectable 
yeast gene flanked at the 5' and 3' ends by 
acylcoenzyme A: cholesterol acyltransferase- 
related enzyme II. m step 2, this was used to 
direct homologous recombination at acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II by transformation of a diploid strain and 
selection for leucine protrophy. in step 3, 
integrants to acylcoenzyme A: cholesterol 
acyltransferase-related enzyme II were 
identified by a PGR reaction using 
oligonucleotides flanking ARE2 (are2-5' and 
are2-3'j and a 3' amplimer within LEU2 (L2-3'). 

A 999-bp fragment identifies are2A, as shown in 
the ethidium bromide stained agarose gel. The 
wild-type fragment (2206-bp) is also produced 
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in the same reaction. Leucine prototrophic 
transformants with deletions of acylcoenzyme A: 
cholesterol acyl transf erase-relared enzyme II 
were obtained at a frequency of -21. M 
5 indicates the 50-2,000-bp ladder markers 

(Bio-Rad Laboratories) . 

Figures 3A and 3B. Fluorescent staining of triglyceride 
and sterol ester. 

10 The cells were grown in YEPD to stationary 

phase, washed with deionized H.O, and incubated 
with 1 /ug/ml Nile Red (1 mg/mi in acetone) . 
Fluorescent images were obtained with a BioRad 
HRC600 laser scanning confocal microscope 

15 (BioRad Microscience, Hercules, CA) on an 

inverted Zeiss Atiovert microscope (Zeiss, 
OberKochem, Germany) using 63X (NA1.4) Zeiss 
Plan-apo infinity corrected objective. Samples 
were illuminated with the 488nm line from an 

20 argon ion laser and the fluorescence was 

visualized with a 540nm dichroic mirror and 
550nm long-pass emission filter. Staining of* 
the cytoplasmic lipid droplets was sensitive to 
treatment with isopropanol, proving them to be 

25 lipid in nature. 

3A. Wild-type cells. 

3B. arelANAare2A double mutant cells. 



Figures 4A, 4B, 4C and 4D. Neutral lipid and sterol 
30 biosynthesis in ARE deletion mutants. 

Strain genotypes are as described in the text; 
dpm/mg dry weight: disintegrations per minute 
per milligram of dry weight of cells. 
4A. Triglyceride biosynthesis. Total lipids were 
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extracteci from cells grown in media containing 
"H-oleate and analyzed by thin-layer 
chromatography . 

Sterol ester biosynthesis. Total lipids were 
extracted from cells grown in media containing 
J H-oleate and analyzed by thin-layer 
chromatography . 

Sterol ester biosynthesis in wild-type and 
mutant cells transformed with vector control 
(black box) or acylcoenzyme A: cholesterol 
a c y 1 t r a n s f e r a s e - r e 1 a t ed enzyme I 

over-expression plasmids, YEp3-16 (increased 
copy number, shaded box) and pADH5-3 6 
(transcription from the ADH promoter, open 
boxes) . Cells were grown in selective media to 
maintain the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I expression 
plasmids. Lipids were labeled, extracted and 
analyzed as above. 
20 4D - Sterol biosynthesis in acylcoenzyme A: 

cholesterol acyltransferase-related enzyme 
deletion mutants. Lipids were labeled in 
synthetic complete media containing [1-'"CJ 
acetate, saponified and extracted with hexane 
and subjected to thin layer chromatography 
analysis. The data is representative of three 
separate experiments and expressed as the ratio 
of incorporation into sterols to incorporation 
into fatty acids. 

30 

Figures 5A, 5B, 5C , 5D, 5E and 5F. The nucleic acid and 
amino acid or predicted amino acid sequences. 

5A-1 - 5A-3. 

The nucleic acid sequence of human acylcoenzyme 
35 A; ch °lesterol acyltransf erase I designated 
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Sequence ID No.: 1. The amino acid sequence of 
human acy i coen z yme A: cholesterol 

acyltrans f erase I designated Sequence ID No.: 
2 . 

5A-1. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 

acyl transferase I from nucleic acid 
bases 1-1624. Amino acid sequence of 
human acylcoenzyme A: cholesterol 
acyl transferase I from amino acid 
residues 1-76. 

5A-2 . Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 

acyl transferase I from nucleic acid 
bases 1625-2524. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acyl transferase I from amino acid 
residues 77-376. 

5A-3. Nucleic acid sequence of human 

acylcoenzyme A: cholesterol 

acyltransf erase I from nucleic acid 
bases 2525-3649. Amino acid sequence 
of human acylcoenzyme A: cholesterol 
acy 1 transferase I from amino acid 
residues 377-551. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyl transf erase-reia ted enzyme 
I designated Sequence ID No.: 3. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyl trans ferase-related enzyme I 
designated Sequence ID No.: 4. 

5B-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyl trans ferase- 
related enzyme I from nucleic acid 
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bases 1-1289. Amino acid sequence of 
acylcoenzyme A: cholesterol 

acyltransf erase-related enzyme I from 
amino acid residues 1-209. 
5B-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransf erase- 
related enzyme I from nucleic acid 
bases 1290-2114. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 21C-484. 
5B-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme I from nucleic acid 
bases 2115-2601. Amino acid sequence 
of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I from 
amino acid residues 485-611. 

-3. 

The nucleic acid sequence of yeast acylcoenzyme 
A: cholesterol acyltransferase-related enzyme 
II designated Sequence ID No.: 5. The amino 
acid sequence of yeast acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II 
designated Sequence ID No.: 6. 

5C-1. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
bases 1-1061. Amino acid sequence of 
acylcoenzyme A: cholesterol 

acyltransferase-related enzyme II 
from amino acid residues 1-238. 

5C-2. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acyltransferase- 
related enzyme II from nucleic acid 
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bases 1062-1961. Amine acid sequence 
of acylcoenzyme A: cholesterol 
acyltransf erase-rela ted enzyme II 
from amino acid residues 239-538. 
5 5C-3. Nucleic acid sequence of acylcoenzyme 

A: cholesterol acylt rans f erase- 
related enzyme II from nucleic acid 
bases 1962-2421. Amino acid sequence 
of acylcoenzyme A: cholesterol 
10 acyltransf erase-related enzyme II 

from amino acid residues 539-643. 

5D. The nucleic acid sequence of mouse acylcoenzyme 

A: cholesterol acyltransf erase II designated 
15 Sequence ID No.: 11. The amino acid sequence 

of mouse acylcoenzyme A: cholesterol 
acyltransf erase II designated Sequence ID No.: 
12 . 

2 0 Figure 6A. A restriction map of the expression vector 

YepAB-ACAT2 . 

Figure 6B and 6C . Expression of human macrophage ACAT 
in pRS426GP. 

25 6B. The ACAT open reading frame was inserted 

at the AfotI and Sad sites, downstream of 
the promoter of the GAL1/10 gene 
(GALl/lOp) as described in the text to 
produce pRS426-ACAT. URA3 and Amp r denote 

30 selectable markers for yeast and £. coli 

respectively. The yeast and bacterial 
origins of replication (2^m and ori, 
respectively ) are indicated. 
6C . Immunoblot of human ACAT in protein 
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extracts from cells transformed with 
pRS4 2 6-ACAT . Double mutant cells (arel 
are2 ), transformed with pRS426-ACAT 
(hACAT) or with pRS426GP (vector), were 
5 induced by growth in galactose . Proteins 

were analyzed by immunoblott ing . 
Equivalent amounts of protein extracts 
from mouse adrenal cells were loaded for 
comparison. Molecular weight reference 
markers (BioRad) are indicated (MJ . The 
arrow indicates the position of the DM10 
immunoreactive product in extracts from 
murine adrenals. The expressed form of 
hACAT in yeast is of coincident mobility. 
15 Figures 7A and 7B. Multiple human tissue Northern 

analysis of poly (A) + RNAs probed with 32 P-labeled cDNA 
CI . 

7A. Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransf erase II 
using a wildtype acylcoenzyme A: cholesterol 
acyltransferase II specific probe. 
7B. Tissue specific expression of wildtype human 

acylcoenzyme A: cholesterol acyltransferase I 
using a wildtype acylcoenzyme A: cholesterol 
acyltransferase I specific probe. 

Figure 8A, 8B, 8C and 8D . Tissue specific expression of 
ARGP1 and hACAT. 

8A and 8B. Multiple tissue Northerns (Clontech) with 

indicated samples were probed with an 
ARGP1 specific probe as described in the 
text. 

8C and 8D. The same blots were also analyzed using a 

hACAT specific probe. The first panel is 
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identical to that published by Chang et al 
(8) . The second panel is the same blot as 
in A and B, probed with the ACAT cDNA 1600 
bp probe. 

Figure 9. Fetal Tissue specific expression of AGRP2. 

Multiple tissue Northerns of fetal tissue (Clontech) 
with indicated samples, were probed with and AGRP2 
specific probe as described in the text. 



Figure 10. Cultured cell expression of AGRP1. 

RNA samples from HepG2 and CV1 were reverse 
transcribed and PCR amplified as described in the 
text. P indicate a plasmid template control. The 
15 blank lanes represent water or no RT controls. 

Figure 11. Sequence comparison of human ACAT and AGRP1 
Figure 12. Sequence comparison of human ACAT and AGRP2 

20 

Figure 13. Phylogenetic Comparisons of ACAT like 

molecules . 

The sequences shown were identified in genome 
databases and aligned based on protein sequence 
25 using GCG Inc software (pileup) . They were 

subsequently arranged to their sequence conservation 
to determine approximate evolutionary relatedness. 

Figure 14 . Conserved motifs in ACAT relate gene 
30 products. 

Figure ISA and 15B. Nucleotide and predicted protein 
sequence of ARGP1 . 
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Figure 16. Nucleotide and predicted protein sequence of 
ARGP2 . 
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Detailed Description of the Invention 

Throughout this application, references to specific 
nucleotides are to nucleotides present on the coding 
5 strand of the nucleic acid. The following standard 

abbreviations are used throughout the specification to 
indicate specific nucleotides: 

C=cytosine A=adenosine 
10 T=thymidine G=guanosine 

A "gene" means a nucleic acid molecule, the sequence of 
which includes all the information required for the 
normal regulated production of a particular protein, 
15 including the structural coding sequence, promoters and 

enhancers . 

The nucleic acids or oligonucleotides of the subject 
invention also include nucleic acids or oligonucleotides 

20 coding for polypeptide analogs, fragments or derivatives 

which differ from naturally-occurring forms in terms of 
the identity or location of one or more amino acid 
residues (deletion analogs containing less than all of 
the residues specified for the protein, substitution 

25 analogs wherein one or more residues specified are 

replaced by other residues and addition analogs where in 
one or more amino acid residues is added to a terminal or 
medial portion of the polypeptides) and which share some 
or all properties of naturally-occurring forms. These 

30 nucleic acids or oligonucleotides include: the 

incorporation of codons "preferred" for expression by 
selected non-mammalian hosts; the provision of sites for 
cleavage by restriction endonuclease enzymes; and the 
provision of additional initial, terminal or intermediate 

35 DNA sequences that facilitate construction of readily 
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expressed vectors. 



1C 



The nucleic acids and oligonucleotides described and 
claimed herein are useful for the information which they 
provide concerning the amino acid sequence of the 
polypeptide and as products for the large scale synthesis 
of the polypeptide by a variety of recombinant 
techniques. The molecule is useful for generating new 
cloning and expression vectors, transformed and 
transfected prokaryotic and eukaryotic host cells, and 
new and useful methods for cultured growth of such host 
cells capable of expression of the polypeptide and 
related products. 

15' An isolated nucleic acid which encodes an acylcoenzyme A: 

cholesterol acyl trans f erase II. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
designated Seq. i.d. No.: 7. The isolated nucleic acid 
20 encodes a human wiidtype acylcoenzyme A: cholesterol 

acyltransferase II having substantially the same amino 
acid sequence as the sequence designated Seq. I.D. No.: 
8. Specifically the isolated nucleic acid has the 
sequence designated Seq. I.D. No.: 11. The isolated 
nucleic acid encodes a mouse . wiidtype acylcoenzyme A: 
cholesterol acyltransferase II having substantially the 
same amino acid sequence as the sequence designated Seq. 
I.D. No.: 12. Further, the isolated nucleic acid of 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II. 



25 



30 



35 



An isolated nucleic acid which encodes an acylcoenzyme A: 
cholesterol acyltransferase III. This isolated nucleic 
acid may be DNA or RNA, specifically cDNA or genomic DNA. 
Specifically, the isolated nucleic acid has the sequence 
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as set forth in Fig. 16. The isolated nucleic acid 
encodes a human wildtype acylcoenzyme A: cholesterol 
acyitransf erase III having substantially the same amino 
acid sequence as set forth in Fig. 16. Further, the 
5 isolated nucleic acid of encodes a mutant acylcoenzyme A: 

cholesterol acyitransf erase III. 



As used in this application, "acylcoenzyme A: cholesterol 
acyitransf erase III" means and includes any polypeptide 

10 having acylcoenzyme A: cholesterol acyitransf erase III 

activity and having an amino acid sequence homologous to 
the amino acid sequence of human acylcoenzyme A: 
cholesterol acyitransf erase II (the sequence of which is 
set forth in Fig. 15) . Thus, this term includes any such 

15 polypeptide whether naturally occurring and obtained by 

purification from natural sources or non-naturally 
occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 

20 same as, or identical to the sequence of any mammalian 

homolog of the human polypeptide, e.g. murine, bovine, 
porcine, etc. homologs. Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 

25 nonmutants or nonvariants. 



As used in this application, ''acylcoenzyme A: cholesterol 
acyitransf erase II" means and includes any polypeptide 
having acylcoenzyme A: cholesterol acyitransf erase II 

30 activity and having an amino acid sequence homologous to 

the amino acid sequence of human acylcoenzyme A: 
cholesterol acyitransf erase III (the sequence of which is 
set forth in Fig. 16). Thus, this term includes any such 
polypeptide whether naturally occurring and obtained by 

35 purification from natural sources or non-naturally 
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occurring and obtained synthetically, e.g. by recombinant 
DNA procedures. Moreover, the term includes any such 
polypeptide whether its sequence is substantially the 
same as, or identical to the sequence of any mammalian 
5 homolog of the human polypeptide, e.g. murine, bovine, 

porcine, etc. homologs . Additionally, the term includes 
mutants or other variants of any of the foregoing which 
retain at least some of the enzymatic activity of 
nonmutants or nonvariants . 



10 



15 



20 



30 



The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase II, but which 
do not produce phenotypic changes. 

The invention also encompasses DNAs and cDNAs which 
encode amino acid sequences which differ from those of 
acylcoenzyme A: cholesterol acyltransf erase III, but 
which do not produce phenotypic changes. 



The nucleic acid of the subject invention also include 
nucleic acids that encode for polypeptide analogs, 
fragments or derivatives which differ from naturally- 
occurring forms in terms of the identity or location of 
25 one or more amino acid residues (including deletion 

analogs containing less than all of the residues 
specified for the protein, substitution analogs wherein 
one or more residues specified are replaced by other 
residues and addition analogs wherein one or more amino 
acid residues is added to a terminal or medial portion of 
the polypeptides) and which share some or all properties 
of the naturally-occurring forms. 



35 



The polypeptide of the subject invention also includes 
analogs, fragments or derivatives which differ from 
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naturally-occurring forms, but having acylcoenzyme A: 
cholesterol acyl transferase activi ty . 

This invention also provides a vector comprising an 
5 isolated nucleic acid encoding acylcoenzyme A: 

cholesterol acyl trans f erase II or III. The isolated 
nucleic acid of the vectors is operatively linked to a 
promoter of RNA transcription which maybe, or is 
identical to, a bacterial, yeast, insect or mammalian 
10 promoter. The vector may be a plasmid, cosmid, yeast 

artificial chromosome ( YAC ) , bacteriophage or eukaryotic 
viral DMA. Specifically, this invention provides a 
vector designated YepAB-ACAT2 (Figure 6) . 

15 Further other numerous vector backbones known in the art 

as useful for expressing proteins may be employed. Such 
vectors include but are not limited to: adenovirus, 
simian vi-vis 40 (SV40), cytomegalovirus (CMV) , mouse 
mammary tumor virus (MMTV) , Moloney murine leukemia 

20 virus, murine sarcoma virus, and Rous sarcoma virus, DNA 

c delivery systems, i.e liposomes, and expression plasmid 
delivery systems. 

This invention also provides a vector system for the 
25 production of a polypeptide which comprises the vector in 

a suitable host. Suitable host includes a cell which 
includes, but is not limited, prokaryotic or eukaryotic 
cells, e.g. bacterial cells (including gram positive 
cells) , yeast cells, fungal cells, insect cells and 
30 animal cells. 

Suitable animal cells include, but are not limited to, 
HeLa ceils, Cos cells, CV1 cells and various primary 
mammalian cells. Numerous mammalian cells may be used as 
35 hosts, including, but not limited to, the mouse 
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fibroblasc ceil NIK 3T3, CHO cells, Ltk' cells, etc. 
Expression plasmids such as that described supra may be 
used to transfect mammalian cells by methods well known 
in the art such as calcium phosphate precipitation, 
5 electroporation. 



10 



This invention also provides a method for producing a 
polypeptide (e.g. acylcoenzyme A: cholesterol 
acyltransferase) which comprises growing a host vector 
system under suitable conditions permitting production of 
the polypeptide and recovering the polypeptide sc 
produced. Methods of recovering polypeptides produced in 
such host vector systems are well-known in the art and 
typically include steps involving cell lysis, 
15 solubilization and chromatography. 

This invention also provides a method of obtaining a 
polypeptide in purified form which comprises: (a) 
introducing a vector, as described above, into a suitable 

20 host cell; (b) culturing the resulting cell so as to 

produce the polypeptide; (c) recovering the polypeptide 
produced in step (b) ; and (d) purifying the polypeptide 
so recovered. As discussed above the vector may include 
a plasmid, cosmid, yeast artificial chromosome, 

25 bacteriophage or eukaryotic viral DNA. Also, the host 

cell may be a bacterial cell (including gram positive 
cells), yeast cell, fungal cell, insect cell or animal 
cell. Suitable animals cells include, but are not 
limited to HeLa cells, Cos Cells, CVl cells and various 

30 primary mammalian cells. Culturing methods useful for 

permitting transformed or transfected host cells to 
produce polypeptides are well known in the art as are the 
methods for recovering polypeptides from such cells and 
for purifying them. 

35 
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Using the aforementioned method, this invention also 
provides a purified wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III and a purified mutant 
acylcoenzyme A: cholesterol acyltransferase II or III. 

This invention also provides an oligonucleotide of at 
least 15 nucleotides capable of specifically hybridizing 
with a unique sequence of nucleotides present within a 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III without hybridizing 
to a nucleic acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransferase II or III. Further, this 
invention also provides an oligonucleotide of at least 15 
nucleotides capable of specifically hybridizing with a 
unique sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase II or III without hybridizing to a 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyltransferase II or III. These 
oligonucleotide DNA or RNA. Such oligonucleotides may be 
used in accordance with well known standard methods for 
known purposes, for example, to detect the presence in a 
sample of DNA which will hybridize thereto. 

The oligonucleotides include, but are not limited to, 
oligonucleotides that hybridize to mRNA encoding 
acylcoenzyme A: cholesterol acyltransferase II or III so 
as to prevent translation of the protein. 

This invention also provides a nucleic acid having a 
sequence complementary to the sequence of the isolated 
nucleic acid which encodes acylcoenzyme A: cholesterol 
acyltransferase II or III. 

This invention also provides a method for determining 
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whether a subject known to have an imbalance in sterol 
levels has the imbalance due to a defect in 
esterification of sterol which comprises (a) obtaining 
from the subject an appropriate sample containing a 
mixture of all of the subject's nucleic acids; and (b; 
determining whether any nucleic acid in the sample from 
step (a) is, or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: cholesterol 
acyltransferase so as to thereby determine whether the 
subject's imbalance in sterol levels is due to a defect 
in esterification of sterol. The determination step (b) 
may comprises: (I) contacting the sample of step (a) with 
the isolated nucleic acid which encodes acylcoenzyme A: 
cholesterol acyltransferase II or u: or the 
15 oligonucleotide of at least 15 nucleotides capable of 

specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyltransferase II 
or III without hybridizing to a nucleic acid which 
20 encodes a mutant acylcoenzyme A: cholesterol 

acyltransferase II or III under conditions permitting 
binding of any nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransferase to the 
nucleic acid or oligonucleotide so as to form a complex; 
(ii) isolating the complex so formed; ■• and (iii) 
identifying the nucleic acid in the isolated complex so 
as to thereby determine whether any nucleic acid in the 
sample contains a nucleic acid which is, or is derived 
from, a nucleic acid which encodes a mutant acylcoenzyme 
A: cholesterol acyltransferase II or III. In this 

method, both the isolation of any complex formed are 
effected using standard methods well known in the art. 

35 m order to facilitate identification of the nucleic acid 



25 
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from step (a) the isolated nucleic acid or the 
oligonucleotide is labeled with a detectable marker. The 
detectable marker may be a radioactive isotope, a 
fluorophore or an enzyme. In additions, the nucleic acid 
sample may be bound to a solid matrix before performing 
step ( I ) . 



This invention also provides a method for treating a 
subject who has an imbalance in sterol levels due to a 
10 defect in esterif ication of sterol which comprises 

introducing an isolated nucleic acid which encodes a 
wildtype acylcoenzyme A: cholesterol acyi trans f erase II 
or III into the subject under conditions such that: the 
nucleic acid expresses a wildtype acylcoenzyme A: 
cholesterol acyl transferase II or III, so as to thereby 
treat the subject. 



15 



This invention also provides a method for inhibiting 
wildtype acylcoenzyme A: cholesterol acyl trans f erase II 
20 or HI in a subject which comprises transforming 

appropriate cells from the subject with a vector which 
expresses the nucleic acid complementary to the isolated 
nucleic acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyl transferase II or III, and introducing 
25 the transformed cells into the subject so as to thereby 

inhibit wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. Further, in a preferred 
embodiment, the nucleic acid is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme A: 
cholesterol acyltransferase II or III so as to prevent 
translation of the mRNA molecule. 



30 



35 



This invention also provides a method for inhibiting the 
wildtype acylcoenzyme A: cholesterol acyltransferase II 
or III in a subject which comprises introducing an 
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oiigonucleotide of at lease 15 nucleotides capable of 
specifically hybridizing with a unique sequence of 
nucleotides present within a nucleic acid which encodes 
a wildtype acylcoenzyme A: cholesterol acyl transferase II 
5 or III without hybridizing to a nucleic acid which 

encodes a mutant acylcoenzyir.e A: cholesterol 
acyltransferase II or III into the subject so as to 
thereby inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransferase II or III. The oligonucleotide is 
capable of specifically hybridizing to a mRNA molecule 
encoding acylcoenzyme A: cholesterol acyltransferase II 
or III so as to prevent translation of the mRNA molecule. 

This invention also provides for a method for identifying 
15 a chemical compound which is capable of inhibiting 

acylcoenzyme A: cholesterol acyltransferase II or III in 
a subject which comprises (a) contacting a wildtype 
acylcoenzyme A: cholesterol acyltransferase II or III 
with the chemical compound under conditions permitting 
20 binding between the acylcoenzyme and the chemical 

compound (b) detecting specific binding of the chemical 
compound to the acylcoenzyme; and © determining whether 
the chemical compound inhibits the activity of the 
coenzyme so as to identify a chemical compound which is 
25 capable of inhibiting acylcoenzyme A: cholesterol 

acyltransferase II or III in a subject. 

This invention also provides method for differentially 
inhibiting one acylcoenzyme A: cholesterol 
acyltransferase but not others using the above methods. 
In an embodiment, only acylcoenzyme A: cholesterol 
acyltransferase I is inhibited. In another embodiment 
only acylcoenzyme A: cholesterol acyltransferase II 
(ARGP1) is inhibited. In an another embodiment only 
acylcoenzyir.e A: cholesterol acyltransferase III (ARGP2 ) 



35 
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is inhibited. Alternatively, two of the acylcoenzyme A: 
cholesterol acyltransf erases may be inhibited. This 
invention further provides pharmaceutical compositions 
which will differentially inhibit one or more 
5 acylcoenzyme A: cholesterol acyltransf erases . 

This invention also provides for a pharmaceutical 
composition comprising the chemical compound identified 
by the above-described method in an amount effective to 
10 inhibit acylcoenzyme A: cholesterol acyl trans f erase II or 

III in a subject and a pharmaceut ically effective 
carrier . 

This invention also provides a method of treating a 
15 subject who has atherosclerosis comprising the above- 

described pharmaceutical composition. A method of 
treating a subject who has hyper 1 ipidemia comprising the 
above-described pharmaceutical composition. 

20 This invention also provides a transgenic, nonhuman 

mammal comprising the isolated nucleic acid which encodes 
acylcoenzyme A: cholesterol acyl trans f erase II or III. 
The mammal includes, but is not limited to, a mouse, 
bovine, cat or dog. 

25 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
30 way the invention as set forth in the claims which follow 

thereafter . 
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Exoerimental Details 

First Series of Experiments 

Example 1 : 

Materials and Methods- 



Transformation of yeast was performed with lithium 
acetate (15) by amino-acid prototrophy selection. A 
diploid strain (5051) was constructed between two 

10 isogenic derivatives of W303 (16); W1346-3C ( MA Ta , 

ade2-l, canl-100, his3-U, 15, leu2-3, 112, trpl-1, 
ura3-l) and W1134-2C (MATa, canl-100, his3-ll, 15, 
leu2-3, 112, trpl-l, ura3-l, metl4DHpaI-SalI) . Growth on 
complete ( YEPD) or synthetic medium, sporulation and 

15 dissection was performed as described (17) . 

Competent cells of Escherichia coli strain DH5a 
(Gibco-BRL) and DNA modifying enzymes (Promega) were used 
according to the manufacturers instructions. pH3(34), 

20 from L.A. Grivell, was digested with Nhe I, blunt-ended 

with Kienow sequences, and digested with Avr II- to 
liberate a 1614-bp fragment. An Xba I, Sma I fragment of 
pJH-Hl encoding the KIS3 gene was then inserted at these 
sites in the vector backbone to produce the arelANA 

25 allele. This construct was digested with Bsa I to ' 

liberate a 3821-bp fragment which was then transformed 
into strain 5051. Disruption of ARE1 was confirmed by 
Southern blot analysis. 



30 



Radioactive probes of acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I were prepared by random 
priming (Pharmacia) with »P-dCTP. Genomic DNA (18) was 
transferred to Hybond membranes (Amersham) and hybridized 
in the absence of formamide at 65° or 60°C (19) . 
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A shotgun library of cosmid 14-21 from chromosome XIV 
(Peter Philippsen, Biozentrum Basel) was constructed 
using the nebulizing technique (20) . The DNA was 
nebulized (90 seconds, 2 bars), size fractionated, 
5 treated with DNA polymerase I (Klenow fragment) and T4 

DNA polymerase and blunt-end ligated into pTZ18R 
(Pharmazia, Germany) . Nucleotide sequencing was 

performed by dideoxy-chain- termination with 

digoxigenin-labeled reverse primer and Sequenase (United 

10 . States Biochemical) . The reactions were analyzed on the 

GATC 1500 direct blotting electrophoresis system (GATC 
GmbH, Germany) using the Boehringer-Mannheim 
Dig-development protocol. Sequences were aligned by 
SeqMan (DNA Star Inc.). Database searching was performed 

15 with BLAST (21) and GCG Inc. software (22) . The DNA 

sequence of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I and acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II genes are 
deposited at GenBank (P25628 and U51790, respectively) . 

20 

KO-5 and K O - 3 ' primers 

( G AG G G G AC G AAAA T T AG C C G C T AT T AA T T C T G G T AT T G C C AC C T AG AC AAG AAG 
TAAACAGACACAGATGcaagagttcgaatctcttagc (Sequence ID No.: 
15) and CTATAAAGATTTAATAGCTCCACAGAACAGTTGCAGGATGCCTTAGGGT 

25 CGActacgtcgtaaggccgtttctgac (Sequence ID No.: 16), 

respectively; lower case corresponds to the LEU2 gene) 
were used in a PGR with the LEU2 gene as a template to 
produce the selectable yeast gene flanked by. acylcoenzyme 
A: cholesterol acyltransferase-related enzyme II gene 

30 sequences (23) . This was used to transform a derivative 

of yeast strain 5051, heterozygous for the arelANA 
allele. To identify integrants at the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II locus, "a 
PGR was performed on genomic DNA from these strains using 
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are2-5' ( CATTGCAGTTACACGTGAATGC ) (Sequence ID No.: 17), 
are2-3': ( TAGCTCCACAGAACAGTTGCAGG ) (Sequence ID No.: 18; 
and a 3' amplimer corresponding to the LEU2 gene (L2-3' 
CTCTGACAACAACGAAGTCAG) (Sequence ID No.: 19) . 



10 



15 



1-2 units at an absorbance of 6000nm of cells were 
incubated in YEPD or defined media containing 1 uci/ml 
J H-oleate in tyloxapol/ethanol (1:1) for 16 hours. Total 
lipids were prepared by hexane extraction (25) and 
analysed by thin layer chromatography on DC-plas ti kf olien 
kieselgei 60 plates (E-Merck, Germany) . The plate was 
developed in hexane, diethyl ether and acetic acic 
(70:30:1) and stained with iodine vapor. Incorporation 
of label into triglyceride and ergosterol ester was 
ascertained following scintillation counting and 
normalization to a "C-choiesterol internal standard and 
the dry weight of the cells. 

20 To overexpress the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I gene by copy number 
under the control of its own promoter in YEp3-16, a 2354 
bp Cla I fragment from pH3(34), encompassing the entire 
acylcoenzyme A: cholesterol acyltransferase-related 

2 5 enzyme I gene, was made blunt-ended with Klenow DNA 

polymerase I and introduced into the Sma I site of 
YE P 352. To constitutively overexpress acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I from the ADH 
promoter in pADH5-36, a 2290 bp Nar I fragment of 

30 pH3(34), starting 70 bp 5' to the ORF was blunt-ended 

with Klenow and ligated to Klenow- treated, Eco RI 
digested, pDC-ADH (a derivative of pS5) (26) . Increased 
expression of the acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I transcripts, relative to 

35 a wild-type cell, was confirmed by northern blot 
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analysis . 

The incorporation of [1- M C] acetate into saponified 
lipids was assessed as a measurement of sterol synthesis. 
5 Approximately 2 OD 0o0 units of ceils were incubated with 

20 A<Ci [1- 14 C] acetate in 2 ml defined media at 30°C for 
3 hours and subjected to lipid saponification, hexane 
extraction and TLC chromatography (29) . The 
incorporation of counts into total sterols were assessed 
10 following scintillation counting. To normalize the 

estimate of sterol biosynthesis to incorporation of 
acetate into the fatty acid pool, the aqueous iysate 
remaining after hexane extraction was acidified with 
concentrated HC1 and re-extracted with hexane (30) . 



15 



Experimental Discussion 



To use yeast genetics to study sterol ester i fication, the 
human acyicoenzyrae A: cholesterol acyl trans f erase 

20 sequence was used to search for homologous yeast genes 

and subsequently to identify an additional human xsoform 
(Figures 1A and IB). Acylcoenzyme A: cholesterol 
acyl transferase related enzyme I, an 1830-bp open reading 
frame (ORF) on yeast chromosome III, encodes a 610- 

25 residue protein with 23% identity and 49$ similarity to 

human acylcoenzyme A: cholesterol acy 1 t ransf erasel 
(Figures 1A and IB) . The yeast and human proteins 
possess leucine zipper motifs that could mediate 
protein-protein interact ic -\s (ester i fication is probably 

30 performed by a multimeric complex) (12), and possess at 

least two predicted transmembrane domains that may 
mediate the membrane association of the acylcoenzyme A: 
cholesterol acyltransf erase reaction (13, 14). 

35 To define the role of acylcoenzyme A: cholesterol 
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acyltransferase-related enzyme I in sterol 

esterification, the deletion mutant, arelANA, was 
generated by homologous recombination (15, 16, 17) (Fig. 
2A) . In a diploid strain, a 1614-bp segment of one 
5 acylcoenzyme A: cholesterol acyltransf erase-related 

enzyme I allele was replaced with the HIS3 gene and 
confirmed by Southern hybridization (Fig. 2B) . Analysis 
of mutant and wild-type haploid progeny from this diploid 
indicated no differences in growth rates or incorporation 
of 'H-oleate into ergosterol ester. 



10 



20 



The lack of a defect in sterol esterification in arelANA 
strains could result from alternate esterification 
activities. Reduced stringency hybridization of yeast 
15 genomic DNA with the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme I coding sequence as a 
probe indicated that additional homologous sequences were 
present (18, 19). A Bam HI digestion of genomic DNA 
produced the predicted 2.9-kb acylcoenzyme A: cholesterol 
acyltransferase-related enzyme I fragment and a ~6.0-kb 
hybridizing fragment (Fig. 2C) . Contour clamped 

homogeneous electric field electrophoretic analysis of 
yeast chromosomes suggested the latter sequence was 
localized to chomosome X or XIV. On the basis of 
homology to acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I, this gene, designated acylcoenzyme A: 
cholesterol acyltransferase-related enzyme II, encodes a 
second yeast homolog to human acylcoenzyme A: cholesterol 
acyltransferasel (Figures 1A and IB) . The genomic 
30 sequence (20, 21, 22) encompassing acylcoenzyme A: 

cholesterol acyltransferase-related enzyme II on 
chromosome XIV predicts a 5997-bp Bam HI fragment and a 
1929-bp ORF, which translates into a 643-residue 
polypeptide. The yeast acylcoenzyme A: cholesterol 
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acyltransf erase related enzymes genes are 61* and 49v> 
identical at the DNA and predicted protein levels, 
respectively. Arelp, Are2p and the human acylcoenzyme A: 
cholesterol acyl trans ferasel protein are most related at 
the COOH-terminal region (42 ; A identity over a 90-residue 
sequence) (Figures 1A and IB) . 

To assess the contribution of Are2p to sterol 
esterification, one copy of the acylcoenzyme A: 
cholesterol acyl transf erase-related enzyme II coding 
sequence was deleted from the genome of an ARE1 /arelANA 
heterozygous diploid by a polymerase chain reaction 
approach (23) (Fig. 2D). Haploid progeny representing 
the single arelANA and are2A deletions and the arelA 
NAare2A double mutant were obtained. To ascertain the 
effect of deletion of acylcoenzyme A: cholesterol 
acytransf erase-related enzymes genes upon cytoplasmic 
lipid storage, the neutral lipid components (triglyceride 
and sterol ester) of the yeast cells were detected by 
fluorescence microscopy after staining with Nile Red 
(24). In wild-type cells, cytoplasmic fluorescent 

droplets accumulated in stationary phase cultures (Fig. 
3A) . No differences in are single mutants were detected. 
However, the number of droplets observed in arelANAare2A 
double mutants, was one-third to that in wild-type 
strains (Fig. 3B; over multiple fields, 5.57 ± 2.73 vs. 
16.73 ± 4.6 droplets/cell, P<0.05). 

The wild-type and are mutant cells were analyzed for the 
30 incorporation of 3 H-oleate into sterol ester (25) (Fig. 

4B) . No significant differences in triglyceride 

biosynthesis were detected. In contrast to normal sterol 
ester biosynthesis observed in arelANA mutants, 
deficiencies in sterol esterif icat ion were apparent in 
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both are2A and arelANAare2A mutants. These were detected 
by iodine vapor staining of thin layer chromatographs of 
total yeast lipids in addition to the oleate 
incorporation assays. Sterol ester levels of are2A 
5 single mutants were reduced to less than 2 6?, of wild- type 

suggesting the acylcoenzyme A: cholesterol 

acyltransferase-related enzyme II isoform to confer the 
majority of acyltransf erase activity. The arelANAare2A 
double mutant was almost totally deficient in sterol 
10 esterification (less than 1% of wild-type levels). i n 

confirmation of the critical role of Are proteins in 
sterol esterification, microsomes from double mutant 
yeast cells lacked acylcoenzyme A: cholesterol 
acyltransferase activity when assayed in vitro. 

15 

To confirm that the protein encoded by an acylcoenzyme A: 
cholesterol acyltransferase-related enzymes ORF was 
sufficient for sterol esterification, the acylcoenzyme A: 
cholesterol acyltransferase-related enzyme I coding 
sequence was over-expressed in vectors with increased 
copy number (YEp3-16) or elevated transcription (the 
alcohol dehydrogenase promoter in pADH5.-36) (26) . There 
were no detectable changes in triglyceride or 
phospholipid biosynthesis resulting from acylcoenzyme A: 
25 cholesterol acyltransferase-related enzyme I 

over-expression. m are2A or arelANAare2A double 

mutants, acylcoenzyme A: cholesterol acyltransferase- 
related enzyme I over-expression complemented the sterol 
esterification defect (Fig. 4C) . In wild-type and 
30 arelDNA single mutants, the high level expression of 

acylcoenzyme A: cholesterol acyltransferase-related 
enzyme I did not elevate sterol ester synthesis above 
untrans formed controls. This suggests that either 
substrates are limiting in acylcoenzyme A: cholesterol 
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acyi transf erase- related enzymes strains or that the 
enzyme is pos t- t ransla t ionai 1 y regulated as in marxnalian 
cells (27) . 



5 An accumulation of unesterified sterol in ceil membranes 

would likely be deleterious (28) . However, despite the 
major changes in sterol esterif ication conferred by the 
are mutants, we did not detect any reduction in growth 
rates. The established role of sterol es teri f ication in 

10 the storage of sterol suggests that an inability to 

esterify sterol could lead to homeostatic changes in 
sterol biosynthesis. This relationship might account for 
the viability of the mutants. Total lipids, labelled by 
the incorporation of [1- M C] acetate into exponentially 

15 growing cells (29, 30), were saponified and extracted. 

The arel ANaare2A double mutants had a two to three-fold 
lower level of sterol biosynthesis than wild-type ceils, 
although no changes were observed in the single mutants 
(Fig. 4D) . In fact, free sterol concentrations were 

20 roughly equivalent in all cells. Feedback regulation of 

sterol biosynthesis by acylcoenzyme A: cholesterol 
acyl transf erase activity has been observed in mammalian 
cells (31) and may be a common mechanism that maintains 
intracellular sterol at non-toxic concentrations. 

25 

The involvement of multiple gene families in sterol 
homeostasis is common in mammalian and yeast cells, for 
example, the LDL receptor related protein and scavenger 
receptor gene families, the SREBP family, and 3-hydroxy- 
30 3-me thyl -glut anyl-CoA reductase) (4, 32, 33, 34). This 

apparent redundancy of function has clear physiological 
consequences as evidenced by deletion of any one of the 
family members. The observation here of two yeast genes 
for sterol esterif ication provoked the hypothesis of 
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similar redundancy for this reaction in humans. To this 
end, a consensus of the yeast acylcoenzyme A: cholesterol 
acyltransferase-related enzymes and human acylcoenzyme A: 
cholesterol acylcransf erasel sequences was used to 
identify an additional cDNA with significant identity 
(47%) to human acylcoenzyme A: cholesterol 
acyltransferasel and the yeast proteins (Figure IB, 
Genbank accession # R07932) . 

Sterol homeostasis is a complex event under subtle 
regulatory controls, one component of which is sterol 
esterif ication. The demonstration here of multiple yeast 
and human acylcoenzyme A: cholesterol acyl trans f erase 
isoforms raises the possibility that in vivo, the enzymes 
exhibit alternate substrate preferences. The analysis of 
esterif ication reactions in yeast is likely to impact the 
understanding of sterol homeostasis and atherosclerosis 
in humans . 



2C Example 2 : 

Tissue specific expression of acylccenyme A: cholesterol 
acyltransf erase II was analyzed by Northern blot RNA 
hybridization of RNA obtained from the described tissues. 

25 Using the same materials and procedures of Chang, et al . 

(10), the specific expression of acylcoenyme A: 
cholesterol acyl trans f erase II in liver and muscle is 
documents, in contrast to similar experiments using the 
previously known acylcoenyme A: cholesterol 

30 acyltransferase I (10) (Figures 7A and 7B) . Acylcoenyme 

A: cholesterol acyltransferase II was also detected and 
specifically expressed in adrenal, thyroid and testicular 
tissues . 
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Example 3: 



After determining the consensus sequence between the two 
yeast gene and the previously known human acylcoenzyme A: 
cholesterol acyl transferase, the consensus sequence was 
compared to sequences deposited in Genbank. The clones 
containing the sequences that showed similarity to the 
consensus sequence were ordered from the I.M.A.G.E. 
Consortium, affiliated with Research Genetics, Inc., 213C 
Memorial Parkway S.W. Huntsville, Alabama 35801. Clones 
deposited with the I.M.A.G.E. consortium are publicly 
available upon request. A particular clone, Genbank ID 
clone No. Z39933 was chosen. This clone contains a cDNA 
fragment whose sequence encodes human acylcoenyme A: 
cholesterol acyl transferase II. The fragment was cut out 
with restrictions enzymes Bgl II and Not I. The 
resulting fragment was introduced into the yeast 
expression vector pRS426 at Bgl II and Not I sites 
downstream of the yeast promoter (GAL1/GA110) which is 
regulated by carbon sources. The resultant vector was 
designated YepAB-ACAT2 (Figure 6) . 



Example 4 : 



25 Antisense RNA technology can be used to create mice, or 

mouse or human cell lines incapable of translating 
acylcoenzyme A: cholesterol acyltrans f erase II RNA into 
protein. Standard methods may be used to create an 
antisense oligonucleotide to the human homolog of 

30 acylcoenzyme A: cholesterol acyltrans f erase II These 

methods are well known in the art (36). 



Specifically, part or all of a wildtype acylcoenzyme A: 
cholesterol acyl transferase II is ligated adjacent to a 
35 mammalian promoter in the opposite orientation. The 
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promoter and other replicatory mechanisms inside the ceil 
will transcribe a human homoiog of acylcoenzyme A: 
cholesterol acyl transferase II encoding, nonsense strand. 
This strand will bind with the coding mRNA which is 
5 normally synthesized to form a complex. Due to the 

formation of this complex, the antisense strand prevents 
the translation of the coding mRNA into protein. 



Further, one skilled in the art can synthesize an 
oligonucleotide in vitro which is capable of binding the 
mRNA that encodes a human homoiog of acylcoenzyme A: 
cholesterol acyltransf erase II so as to inhibit the 
translation of the mRNA into protein. The 
oligonucleotides can then be introduced into the subject 
using a pharmaceutical iy acceptable carrier. Methods of 
synthesizing naturally and non-naturally occurring 
oligonucleotides which are capable of inhibiting the 
translation of the mRNA into protein are well known in 
the art. Also, means of transfecting an organism with 
such oligonucleotides are well known in the field. 



Example 5: 



Mice can be made with an alteration in their genome, 
specifically at the acylcoenyme A: cholesterol 
acyltransf erase II gene site. Standard methods may be 
used to alter the genome. These methods are well known 
in the art (37, 38) . 



One such process to achieve this goal involves disrupting 
the wildtype mouse homoiog of acylcoenyme A: cholesterol 
acyltransferase II in vitro, then introducing the altered 
gene into mouse embryonal stem cells in such a way as to 
taret integration into the corresponding genomic region. 
This process can be performed such that both copies of 
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the wildtype acylcoenyme A: cholesterol acyl trans f erase 
II are replaced by the altered, knock-out version. These 
modified cells can be introduced into blastocysts which 
will be allowed to develop into chimeric adults. Mice 
bearing the altered acylcoenyme A: cholesterol 
acyltransferase II gene will be mated to each other to 
generate homozygous mutant acylcoenyme A: cholesterol 
acyltransferase II animals. 

Further, one can breed two mice who are heterozygous for 
mutant acylcoenzyme A: cholesterol acyltransferase II. 
From their progeny, one skilled in the art could select 
the progeny who are homozygous for mutant acylcoenzyme A: 
cholesterol acyltransferase II. Breeding and selecting 
such progeny are well known in the art. 
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Second Series of Experiments 



The efficient regulation of intracellular sterol levels 
is required for cell viability by all eukaryotic 
5 organisms. When this regulation is aberrant in cells of 

the arterial wall, disease states such as atherosclerosis 
ensue. A critical component of this homeostasis is 
intracellular sterol esterif ication reaction, mediated by 
the enzyme, acyl coenzyme A-cholesterol acyltransf erase 
10 • (ACAT) . in the model eukaryote, yeast, this laboratory 
has demonstrated that sterol esterif ication is mediated 
by a two gene family (Yang et al . , Science . 1996, 
272:1353). The existence in human cells of two 

additional genes encoding ACAT related enzymes are 
15 demonstrated. These protein are termed ACAT related 

■ gene products (ARGP) l and 2, also known as acylcoenzyme 
A: cholesterol acyl transferase II and acylcoenzyme A: 
cholesterol acyl transferase III respectively. The ARGP s 
exhibit marked sequence conservation to the human ACAT 
20 sequence (hACAT) originally identified by Chang and 

colleagues. ARGP1 is expressed at high levels in 
intestine and liver in contrast to the expression of 
hACAT which is of low abundance in these tissues. The 
observation that knock-out mutant mice deficient in the 
25 murine homolog if hACAT retain sterol esteri f ication 

activity in liver and intestine (Meiner et al . , PNAS, 
1996, 93:14041), suggests that ARGP1 is a candidate for 
sterol esterif ication in these tissues. The expression 
of ARGP 2 , by contrast, seems to be restricted to the 
30 fetal liver, suggesting it to have a role in lipid 

metabolism during development. Analysis of genome 
databases indicates that ACAT-like gene families are a 
common occurrence in multiple organisms. It is 

hypothesize that multiple enzymes for sterol 
35 esterif ication will provide flexibility in response to 

differing sterol and fatty acid substrates encountered by 
different tissues. This further suggests specific roles 
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for these enzymes in lipoprotein production, lipid 
homeostasis, and disease progression. 

The regulation of membrane sterol levels is required for 
3 cell viability by all eukaryotic organisms. When this 

regulation is aberrant in human cells, disease states 
such as atherosclerosis (excessive accumulation of 
cellular esterified cholesterol in cells of the arterial 
wall, reviewed in (1-4)), Niemann Pick C (inability to 
store sterol correctly, resulting in lysosomal lipidosis, 
(5) ) or Wollmann's disease (a defect in sterol ester 
hydrolysis, (6)) ensue. A critical component of this 
homeostasis is the intracellular neutralization of sterol 
by an esterif ication reaction between the Cj-OH group of 
15 cholesterol and fatty acyl -coenzyme A. This reaction is 

performed in mammalian cells by the enzyme acyl coenzyme 
A-cholesterol acyltransf erase (ACAT) . Since the process 
of sterol esterif ication converts sterol into a 
cytoplasmic storage form, it is critical to all 
eukaryote, including the microorganism Saccharomy rss 
cerevisiae (budding yeast) . Analysis if sterol 

homeostasis in this model organism has the advantage that 
molecular genetics, particularly since the completion of 
the yeast genome sequencing project, is powerful and 
25 relatively straightforward. Taking advantage of this, it 

is demonstrated that sterol esterif ication in yeast is 
mediated by a two gene family (7) , neither of which is 
essential for life. These genes ( ARE1 and ARE2; 
encoding ACAT_Related Enzymes 1 and 2, respectively) are 
30 both capable of independently esterifying sterol, 

although in terms of contribution to the sterol ester 
mass of the cell, Arel is a minor isoform relative to 
Are2 . The genes are structurally and functionally 
analogous to the ACAT sequence isolated originally from 
35 macrophages by Chang and colleagues (8) . They share 

approximately 23% identity at the protein level and 
expression of the human macrophage ACAT cDNA in yeast are 
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double deletion mutants results in esterif icat ion of 
sterol ( 9 ) . 

A critical test of the role of the ACAT gene product in 
5 cholesterol homeostasis and atherosclerosis was initiated 

by Farese and colleagues, by the production of "knock- 
outs" at the Acact locus corresponding to the mouse 
homolog of hACAT (10) . The fidelity of the mutation was 
confirmed by sequencing of cDNA from the disrupted allele 

10 and by the failure to detect immunoreact i ve protein in 

Acact^cell extracts. The animals were healthy and 
fertile and had residual, but significant, sterol 
esterif ication activity in fibroblasts and macrophages. 
Cholesterol ester levels and ACAT activity in the 

15 adrenals were also severely reduced. Conversely, Acact^ 1 

livers contained significant levels of cholesterol ester, 
and esterif ication activity was not altered. 
Furthermore, sterol absorption in the intestine, a 
process that probably requires esterif ication, was 

20 unaffected by the gene disruption. These observations 

strongly suggest that as in yeast, there are multiple 
genes for the ACAT reaction in mammalian cells, probably 
y, with tissue specific expression patterns. 

25 Interestingly, despite the clear origin of the yeast gene 

family by gene duplication, the ARE proteins have 
diverged such that the majority of sequence conservation 
is in the COOH- terminal domain of the protein. This is 
presumably the critical region of the molecule, since it 

30 is also conserved in the human protein. Using this 

region as a database probe, R07932 (11) was identified, 
a partially sequenced cDNA entry in the database of 
expressed sequence tags (best) ; R07932 exhibits 
significant similarity to the ACATs particularly over the 

35 COOH- terminal region. Taken together; the "founder" 

sequence, the observations in yeast of a two gene family 
for sterol esterif ication , and the tissue-specific 
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expression patterns of enzyme activity in Acact^knock -out 
mice, suggest that there are multiple genes for this 
reaction in all eukaryote . It is reported here the 
isolation and characterization of cDNAs from two human 
loci that encode ACAT Related Gene Products (ARGP) 
ARGP1 is represented multiple times in the best, 
including R07932, and is expressed ubiquitously with the 
highest levels occurring in the liver, intestine and 
adrenal gland. By contrast, sequences identical to A RGP 2 
in the databases are infrequent, consistent with the 
observation of an essentially embryonic pattern of 
expression. Analysis of genome databases indicates that 
gene families that conserve these motifs are a common 
occurrence in multiple organism. 

Materials and Methods. 

Database searching for ACAT related sequences. A 

sequence corresponding to the strongest region of protein 
conservation between the human macrophage ACAT and yeast 
ARE sequences was used to identify protein sequences 
predicted to be encoded by entries in the best using the 
tblastn software (NCBLI) . The DNA sequences thus arising 
were used to detect additional clones in any available 
database, that demonstrated overlaps of nucleotide 
sequence identity. Databases searched included; best, 
the non-redundant GENBANK, and the confidential database 
held at The Institute of Genome Research (TIGR) 
Overlaps between these sequences were detected using the 
sequence alignment programs, "lineup" and "pileup" from GCG 
Inc (Madison, WI ) . A consensus sequence was then 
generated. Escherichia coli clones with the largest 
inserts corresponding to these sequences (see table 1) 
were obtained from the I.M.A.G.E. consortium and 
resequenced from both ends using commercial primers, T3 
and T7 , or internal primers derived from a consensus. 
Nucleotide sequencing was performed at the Columbia 
University Combined Center core facility using an Applied 
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Biosystems fluorescent sequencing machine. 



Table 1 



Entries of human ACAT related gene products in 
the products in the data base of expressed 
sequence tags. 



10 



15 



20 



25 



30 



35 



Gene 



ARGP1 



ARGP2 



Clone ID 
(IMAGE) 

200587 

55218 



1881180 

78614. 
153836 

106260 

128921 

213176 
245265 



GENEBANK ID 

R99213 

R99214 

c- IMF11 

Z43867 

233993 

H45923 

H45924 

M79086 

R48474 

R48475 

T35085 

R10272 
R10273 
N75438 
H76642 



Insert size 
(bp) 

620 

1800 



1000 

300 
800 

800 

680 

540 
300 



Continents 



chimera 



Isolatio n and seq uencing ' analysis of full length cDNA 
clones of ARGP1 and ARGP2 Since in no instance were any 
of the database clones full length for either ARGP1 or 
ARGP2, additional clones with intact 5' -ends are 
described. Several strategies were chosen using a 
consensus nucleotide sequence derived from the sequencing 
of the best clones designed and synthesized 3 -end, gene 
specific primers and used a PCR based, rapid 
amplification of cDNA ends (RACE) to derive 5' -RACE 
reaction products from a human liver/spleen Marathon 
library (Clontech®) . Similar strategy was used to derive 
PCR products from a human fetal brain library generously 
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provided by Bento Soares (Columbia University) . in some 
instances, a nested PCT reaction was performed using 
internal gene specific primers and- library adaptors. 
Finally primer extension cDNA products were identified 
5 from mRNA extracted from human intestine (a kind gift of 

P. Dawson) . Amplification products of the predicted size 
were confirmed as gene specific, using southern 
hybridization to sequences predicted to be at the 3' -end 
of these products. The products were isolated from 

10 agarose gels using Geneclean and subcloned into TA 

variants of pBluescript ( St rat agene®) vectors of 
klenow/kinase treated and blunt end ligated to pGEM2 
(Promega®)- . Positive clones were identified by colony 
hybridization or by PCR amplifications using an internal 

15 ARGP specific primer. Clones with the largest inserts 

were sequenced to obtain novel sequence and where 
necessary, this process was reiterated with ARG? 5' 
specific primers derived from the new sequence. 

20 - Tissue specific expression of hACAT and ARGps . Fragments 
of the best clones R99213 and R10273 corresponding to 
ARGP1 and ARGP 2 , respectively were derived by digestion 
with EcoRI and Not I , and purified from agarose gels with 
Geneclean. A 1.6 kbp fragment corresponding to the human 

25 ACAT cDNA identified by Chang et al was used as a probe 

for the expression of this gene. . Radiolabel led probes 
were generated by random priming (Pharmacia*) in the 
presence of 32 -P dCTP and used to probe Multiple Tissue 
Northerns (MTN, Clontech®) of human samples. 

30 Hybridizations were performed, according to the 

manufacturers instructions, using ExpressHyb rapid 
hybridization solution for 1 hour at 78°C, followed by 
washed in 2xSSC at 55°C and O.lxSSC, 0.5%SDS at 50°C. 

35 Cell culture expression of ARGPs . To facilitate 

quantitation of mRNA from the ARGP genes, a reverse- 
transcript ase PCR (RT-PCR) approach was devised to 
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analyze expression in a variety of human (HeoG2 / THP-l 
macrophages) and rodent (J774 macrophages) and simian 
(CV1 kidney cells). Where possible/ primers were 
designed to be conserved between rodents and humans (as 
described below, the mouse sequence homolog to ARGP1 ) has 
been identified. Alternatively, PCR conditions were 
optimized to permit moderate mismatches. The ARGP 
amplification primers were designed to be gene specific 
(i.e. to regions not conserve within the family) and to 
produce distinct size products. 



15 



Experimental Results and Discussion 

The approach that the region of strongest conservation 
between the yeast ARE proteins and hACAT would be 
critical to the function of any sterol esterif icat ion 
enzymes was taken. A region of conservation (consensus; 
LN - - - E FGDR - FY GDWWN , single letter amino-acid code) 
that is invariant over the three proteins was chosen and 
a series of entries derived from gene sequencing projects 
20 identified. in addition to sequences from 

Caenorhabditis . eleaans. Schizosacharomuces oombe . 

Drosophila melanoaater and Arabidopis thaliens . several 
entries in the best of human cDNAs that suggested an 
independent gene encoding an ACAT like protein were 
25 observed. Using the nucleotide sequence to this clone/ 

a second homologous but distinct entry was. identified. 
These proteins are termed, ACAT Related Gene Products 
(ARGP) 1 (acylcoenzyme A: cholesterol acyl trans f erase II) 
and 2 (acylcoenzyme A: cholesterol acyl transferase III) . 
30 The sequence identified by Chang et al (8) will be 

referred to as hACAT, hereon. A limited protein sequence 
to a founder clone (R07932) to ARGP1 has been presented 
previously (11) . The entries in the best that define 
these two genes, including their insert sizes are 
35 described in table 1. As is evident, the majority of 

inserts (with the exception of a chimeric clone ZA3867) 
are less than lKbp. The northern and sequence analysis 
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presented indicated them to be incomplete clones. 
However, they clearly define two distinct genes of strong 
similarity to the ACAT sequence, with the majority of 
predicted protein conservation at the COOH- terminal 
region. As described below certain motifs considered 
critical to sterol esterif icat ion are conserved. To 
identify the role of these genes in the reaction, full 
length ARGP clones were sought and their patterns 
identified . 



10 



15 



20 



ARGP1 , a ubiquitously expressed member of the ACAT gene 

family . To establish the profile of expression of ARGP1 , 
probed multiple tissue northerns of human mRNA was 
probed, using a fragment close to the 3' end of the gene. 
Although this region displays the maximum conservation at, 
the protein level in this gene family, the. genes are 
sufficiently divergent at the DNA level to be able to 
design gene specific hybridization probes. The ARGP1 
sequence is expressed at abundant levels in may tissues 
with the exception of lung and kidney. The .majority of 
tissues express a 2 . Okb message but, some tissues (e.g. 
adrenal, small intestine, thymus) also express a 2.4kb 
mRNA at varying levels. The same northerns were 
hybridized with a probe to the human macrophage ACAT 
2 5 sequence. As described by others ( 8 , 12 , 13 ) , the hACAT 

sequence detects 4 messages of approximately 3.0, 4.0, 
4.7 and 7.4Kb. Upon comparison of the two hybridization 
results, an overlapping but occasionally differential 
expression pattern was observed. Adrenal tissues express 
30 the highest levels of both hACAT and ARGP1 message. By 

this analysis, hACAT messages are rare in liver and 
intestine in contrast to ARGP1 which is highly .expressed 
in these tissues. Conversely, ARGP1 was poorly expressed 
in kidney, lung and placenta although hACAT mRNA was 
35 easily detected. This tissue specific expression 

suggests that ARGP1 is an ideal candidate for sterol 
esterif ication in tissues such as liver and intestine, 
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which retain sterol esterif icat ion activity in ACAT k/o 
mice ( 10 ) . 



ARGP2 t — an embryonic i soform of the ACAT aene family. 
5 Efforts to identify a transcript from ARGP2 in adult 

tissues were unsuccessful. Therefore embryonic tissue 
samples were chosen to investigate since the original 
founder clone was derived from a fetal liver library. A 
multiple tissue northern of mRNA from human embryonic 

10 * brain, liver, kidney, and lung, were probed with and 
ARGP2 specific, C00H- terminal probe. As shown in figure 
9, a single message of ~2.2kb was identified only in 
embryonic liver tissues, suggesting a high degree of 
tissue and developmental specificity to the expression of 

15 this gene product. 

Expression of ARGP1 in cell culture models. To develop 
a system in which to test the effect of reaction 
substrates on the esterif ication reaction performed by 
20 the ARGP enzymes. The expression of these genes in 

several tissue specific were examined, cell culture 
. models. As shown in figure 10, ARGP1 is clearly 
- expressed in liver (HepG2) and Kidney (CV-1). cell lines. 
The latter result is somewhat in contrast to the northern 
25 blot on human tissue samples. This most likely reflects 

the sensitivity of the RT-PCR approach compared to filter 
hybridization and suggests that ARGP1 is probably 
expressed in most tissues. Alternatively it may 

represent species difference (simian' vs. human) or more 
interestingly the differentiation status of the cells 
under study. In data not shown here, ARGP1 was also 
clearly expressed in human and mouse macrophage models 
(THP-1 and J774 cells) . 

35 Sequence char acteristics of ARGP1 and ARGP2 . By a 

combination of 5 ' -RACE and primer extension additional 
sequence to cDNA s for ARGP1 and ARGP 2 (Figs. 11 and 12) 



30 
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have been identified. The ARGPl sequence predicts a 4 07 
amino-acid protein with approximately 27% identity and 
52% similarity to the hACAT protein (Fig. 4). 
Interestingly, as it was observed for the yeast ARE 
proteins, the strongest conservation exists at the COOH- 
terminus of the molecules, to the extent that the NH-2- 
terminal 50% of all these proteins is essentially 
unrelated sequence. This pattern also persists at the 
DNA level (not shown) . Identification of the genomic 
sequence to these cDNAs will establish whether this 
remarkable divergence arises by exon shuffling of common 
sequences. Alternatively, convergent evolution of 

domains with conserved functions in sterol escerif ication 
or related processes, may have resulted in the generation 
15 of these families. Since the level of DNA conservation 

between ARGPl and hACAT is quite low (37% identity) , the 
latter possibility seems likely. The conserved regions 
are discussed in the context of multiple ACAT like 
sequences below. The ARGPl sequence predicts a protein 
20 of approximately 47kDa with multiple transmembrane 

domains in similar positions to those predicted in hACAT. 
This strongly suggests a membrane location for ARGPl as 
would be predicted for a sterol esterif ication enzyme. 

25 ARGP2 displays a significantly higher level of amino acid 

conservation with hACAT than does ARGPl . Over the 
sequence shown (Fig. 12), the protein is 59% identical 
and 79% similar to human ACAT. Over the same region 
ARGPl is only conserved at the level of 32% identity. 

30 This striking identity is maintained at the DNA level 

(62% identity) and may suggest that ARGF2 is more closely 
analogous to hACAT in both its mechanism of action and 
its origin, than is ARGPl . As for ARGPl . certain 
hallmark sequences are retained in ARGP2 (see below) 

35 The ARGP2 predicted protein also possesses several 

predicted transmembrane domains. One entry to the best 
for ARGP2 has also been allocated an STS (sequence tagged 
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site) at the Whitehead Institute, (entry U WI-11660) and 
has thus been mapped to human chromosome 12. 

Sterol esterif ication enzy m es evolve as aen e famMioQ -i n 
5 multiple organisms. Using the hACAT and AGRP nucleotide 

sequences as probes of multiple databases, we sought to 
establish whether the observation of gene families of 
ACAT related enzymes in yeast and humans was a common 
occurrence in other organisms. In general this is the 

10 case (Fig. 13). Sequences from the genome of C. eleaans, 

D. melanogastor and S . pombe . have been identified that 
are distinct from each other, within an organism, and 
exhibit approximately 25% identity at the predicted 
protein level. As for all the ACAT-like proteins, the 

15 maximum conservation is observed at the COOH- terminal 

region, with many of the apparently critical motifs 
described below, being maintained. As would be 

anticipated the. mouse cDNA for ARGP1 exhibits 
approximately 85% identity with its human homolog. 

20 

Sequence conservation between ARGPs and ACAT in multiple 
organisms . As described above, these sequences are 
-. ubiquitous. This conservation, across and within 

organisms, facilitates the identification of critical 
25 domains of esterif ication enzymes (Fig. 14) 

Interestingly, there is no sequence similarity between 
any ACAT-like molecule and lecithin cholesterol 
acyltransferase (LCAT) , despite the shared utilization of 
cholesterol. For the hACAT sequence and its murine 
homologs, a similarity to "signature' motifs of enzymes 
involved in acyl adenylation reactions was reported (8, 
12) . However, these sequences are unlikely to be 
critical, since they are not conserved in any homolog 
from any other organism. By contrast, there are regions 
35 °f strong conservation between these molecules which may 

be critical to function. In the esterif ication 

defective, SRD4 mutant CHO cell line, the expressed but 



30 
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defective A CAT allele encodes a single amino-acid 
substitution of leucine 265 lies in a conserved domain of 
human, rodent and yeast ACAT . Interestingly, this motif 
in ARGP1 is more degenerate, although the serine is 
5 conserved, the flanking sequence is conservatively 

replaced by similar residues. The ACAT reaction is 
probably mediated by a multimeric complex, as shown by 
radiation inactivation experiments (15) .. Accordingly, 
the yeast and human sequences all possess "leucine zipper" 

10 multimerization motifs. ARGP1 and ARGP2 lack a classical 
multimerization motif. Although protein phosphorylation 
as a mode of ACAT regulation has been refuted (16), a 
very strong region of conservation (consensus over 7 
sequences; LN E FGDR- FYGDWWN , single letter amino - 

15 acid code) predicts a tyrosine kinase consensus motif for 

phosphorylation. ARGP2 and ARGP1 are no exception to 
this. In particular the aspartic acid- tryptophan-, 

tryptophan-asparagine (DWWN) sequence appears to be 
invariant (with the exception of S . pombe . where it is 

20 AWWN) and may represent an active site for the 

esterif ication reaction. These regions if conservation 
are targets for mutagenesis and in preliminary 
experiments appear critical to the activity of the ACAT 
and ARE enzymes (17) . 

25 

Why ACAT gene families? The role, if any, of these ACAT 
sequence homologs in sterol homeostasis is unclear. 
Since mouse macrophage ACAT is not critical to sterol 
esterif ication in the liver and intestine, it is possible 

30 that the additional enzymes evolved to recognize 

alternate substrates and thus promote sterol absorption 
in the intestine or production of lipoproteins by the 
liver (39) . Future experiments will be directed to 
complete the molecular characterization of these genes 

35 and test these hypotheses. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: Stephen L . Sturley 

(ii) TITLE OF INVENTION: DNA ENCODING ACYLCOENZYME A: CHOLESTEROL 

ACYLTRANS FERASE 11 AND USES THEREOF 

(lil) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

(B) STREET: 1185 Avenue of the Americas 
CO CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) * MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

0 OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

tvi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: Herewith 
© CLASSIFICATION: 

(vin) ATTORNEY/AGENT INFORMATION: 

(A) NAME: John P. White 

(B) REGISTRATION NUMBER: 28,678 

O REFERENCE/DOCKET NUMBER: 0575/50852-A 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 278-0400 
(B} TELEFAX: (212) 391-0525 
CO TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3649 base pairs 

(B) TYPE: nucleic acid 
© STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GGGTAGAGAC GGGGTTTCAC CGTGTTAGCC AGGATGGTCT GGATCTCCTG ACCTCGTGAT 
CCACCCACCT CGGCCTCCTA AAGTGCTGGG ATTACAGACA TGAGCCACCG CGCCCAGCCC 
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TATTCATCCC 


TTTTCAAAAG 


TCAGACCCTA 


GGAAGCTGGA 


GGGAGGTGGG 


GCATGGTT7T 


i 8 0 


ACAGTGAATT 


TCTGATTTCA 


CTCAGGGTGA 


TAAAT CAGAC 


TCTTGGGGAA 


GCGGGTGGTG 


2 4 0 


GCTCTGGACA 


GC AGCAGGAA 


TGGGGATCCA 


GTTAGCAACA 


AATCCATGGA 


CCTATGACAG 


3 00 


GCTGAAAGCC 


ACCCCTTCTC 


CATCTTTGGG 


AGGTTGCCAA 


TGTCTGATTT 


AAC AC TAT CC 


3 60 


AATGAATGAT 


CAT T G AAAG T 


AAAAAAf AAC 


TATCAACTAG 


CAGAAAATAT 


AAATGGTAAG 


420 


CATTAGCACA 


TATTTCACAT 


GTTTATATTT 


GGCTCTCAGA 


TTGACCTATA 


AAACAAAGTC ' 


4 8C 


TGGGAAATTC 


T AT AT GAT C C 


TGAAAAAATG 


ATACGCTGGT 


CTGGATGGTA 


GAATAAGTTG 


5 4 0 


GAGAAATGTT 


T AAG C C AAAA 


TGCAGTCTTA 


CC AAT GACTT 


TTTATTTTAT 


TTTATTAATT 


60 0 


TTCAGGATTT 


TTGGTATACA 


GGTGGTTTTT 


GGTTACATGG 


AAAAGTTCTT 


TACTGGTGAT 


6 6C 


TTCTGAGATT 


TTAGTTCACC 


CCTTATCCTG 


AGCAGTGTAC 


ACTGTTCCCA 


ATATGTAGCC 


7 2 0 


TTTTATCCCT 


CACCCCCTCT 


AAGTTCAAGA 


AGACTATGGT 


CCTGCAGAAA 


GCTTTATATG 


7 8 0 


TAATTAACAT 


ATCTTTATCT 


TTATCTTTAT 


AGGCAGTAGA 


CTCATCTTTT 


GAAACAGATT 


8 4 0 


CCATTAAGAG 


TGAATGTGTA 


CCCTCCCTCT 


AGCCTTTATT 


ATTACTGTTT 


TTGCTATTAC 


90 0 


ATGTGTTAGT 


GTATGTGAAT 


TTAATGCTTA 


AAAATGTATC 


CCATTGGCTA 


CTATGGCAAA 


960 


AGGTTGACTC 


AT AAGAGT T T 


AGCACGGGTT 


AAGAT C T G AA 


AGTTTTCTCC 


CAGCCTCTTA 


102 0 


TCACTGGCGC 


AGAC7TCACA 


ATT CAT GGAA 


GCCACCAGTG 


AGATGACATT 


GCCTCAGGCA 


108 0 


GTTACTATTT 


TTATATTCTA 


TAACTCGAGG 


AGCTCAGGGT 


TTCGGAAATC 


ATTAAACTTT 


114 0 


TTTTGTCCTT 


TTAAAGTTGG 


AG AC AG C AAT 


TGTAGACAGC 


CTTCCAGTGG 


GTTATCTTTT 


12 00 


TGTGTCTCCT 


TACCTGTGGA 


GAAGCCTATT 


AGCTGGGATA 


TGTAGTTAAA 


TAGCTATATT 


1 2 6G 


TATATATATC 


CAGGGCACCC 


CGAATTCGGG 


AGAGCTTCCC 


GGAGTCGACC 


TTCCTGCTGG 


1320 


CTGCTCTGTG 


ACCGCTTCCC 


GGCTCTGCCC 


TCTTGGCCGA 


AGTGCCCGCT 


GCCGGGC GCG 


13 80 


GGCCTCAGAC 


AATACAATGG 


TGGGTGAAGA 


GAAGATGTCT 


CTAAGAAACC 


GGCTGTCAAA 


14 4 0 


GTCCAGGGAA 


AATCCTGAGG 


AAGAT GAAGA 


CCAGAGAAAC 


CCTGCAAAGG 


AGTCCCTAGA 


1500 


GACAC CTAGT 


AATGGTCGAA 


TTGACATAAA 


ACAGTTGATA 


GCAAAGAAGA 


T AAAGT T G AC 


1560 


AGCAGAGGCA 


GAGGAATTGA 


AGCCATTTTT 


TATGAAGGAA 


GTTGGCAGTC 


ACT T T GAT GA 


1620 


TTTTGTGACC 


AATCT CAT TG 


AAAAGTCAGC 


ATCATTAGAT 


AATGGTGGGT 


GCGCTCTCAC 


1680 


AACCTTTTCT 


GTTCTTGAAG 


GAGAGAAAAA 


CAACCATAGA 


GCGAAGGATT 


TGAGAGCACC 


1740 


TCCAGAACAA 


GGAAAGATTT 


T T ATTGCAAG 






I GCTTGAAGT 


1800 


GGACCACATC 


AGAACAATAT 


AT C AC AT GT T 


TATTGCCCTC 


CTCATTCTCT 


TTATCCTCAG 


1860 


CACACTTGTA 


GTAGATTACA 


TTGATGAAGG 


AAGGCTGGTG 


CTTGAGTTCA 


GCCTCCTGTC 


1920 


TTATGCTTTT 


GGCAAATTTC 


CTACCGTTGT 


TTGGACCTGG 


TGGATCATGT 


TCCTGTCTAC 


1980 


ATTTTCAGTT 


CCCTATTTTC 


TGTTTCAACA 


TTGGCGCACT 


GGCTATAGCA 


AGAGTTCTCA 


2040 
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2160 



2400 
2460 
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TCCGCTGATC CGTTCTCTCT TCCATGGCTT TCTTTTCATG ATCTTCCAGA TTGGAGTTCT 
AGGTTTTGGA CCAACATATG TTGTGTTAGC ATATACACTG CCACCAGCTT CCCGGTTCAT 

CATTATATTC GAGCAGATTC GTTTTGTAAT GAAGGCCCAC TCATTTGTCA GAGAGAACGT 2 22 0 

GCCTCGGGTA CTAAATTCAG CTAAGGAGAA AT CAAGCACT GTTCCAATAC CTACAGTCAA 22 8 0 

CCAGTATTTG TACTTCTTAT TTGCTCCTAC CCTTATCTAC CGTGACAGCT ATCCCAGGAA 23 4 0 
TCCCACTGTA AGAT GGGGTT ATGTCGCTAT GAAGTTTGCA CAGGTCTTTG GTTGCTTTTT 
CTATGTGTAC TACATCTTTG AAAGGCTTTG TGCCCCCTTG TTTCGGAATA TCAAACAGGA 

GCCCTTCAGC GCTCGTGTTC TGGTCCTATG TGGTATTTAA CTCCATCTTG CCAGGTGTGC 2520 

TGATTCTCTT CCTTACTTTT TTTGCCTTTT TGCACTGCTG GCTCAATGCC TTTGCTGAGA 2 580 

TGTTACGCTT TGGTGACAGG ATGTTCTATA AGGATTGGTG GAACTCCACG TCATACTCCA 264 0 

ACTATTATAG AACCTGGAAT GTGGTGGTCC ATGACTGGCT ATATTACTAT GCTTACAAGG 2700 

ACTTTCTCTG GTTTTTCTCC AAGAGATT C A AATCTGCTGC CATGTTAGCT GTCTTTGCTG 2 7 60 

TATCTGCTGT AGTACACGAA TATGCCTTGG CTGTTTGCTT GAGCTTTTTC TATCCCGTGC 2820 

TGTTCGTGCT CTTCATGTTC TTTGGAATGG CTTTCAACTT CATTGTCAAT GATAGTCGGA 2 8 80 

AAAAGCCGAT TTGGAATGTT CTGATGTGGA CTTCTCTTTT CTTGGGCAAT GGAGTCTTAC 2 94 0 

TCTGCTTTTA TTCTCAAGAA TGGTATGCAC GTCGGCACTG TCCTCTGAAA AATCCCACAT 3000 

TTTTGGATTA TGTCCGGCCA CGTTCCTGGA CTTGTCGTTA CGTGTTTTAG AAGCTTGGAC 3060 

TTTGTTTCCT CCTTGTCACT GAAGATTGGG TAGCTCCCTG ATTTGGAGCC AGCTGTTTCC 3120 

AGTTGTTACT GAAGTTATCT GTGTTATTTG GACCACTCCA GGCTTTACAG ATGACTCACT 313 0 

CCATTCCTAG GTCACTTGAA GCCAAACTGT TGGAAGTTCA CTGGAGTCTT GTACACTTAA 32 4 0 

GCAGAGCAGA ACTTTTTTTG TGGGGCTGGG TGGGGGGAGA AGACCGACTA AC AGCT GAAG 3 3 00 

TAATGACAGA TTGTTGCTGG GT CAT AT C AG CTTTATCCCT TGGTAATTAT ATCTGTTTTG 3 360 

TTTCTTGACT CTGTCCAATC AGAGAATAAA CATCATAGTT TCTTGGCCAC TGAATTAGCC 3420 

AAAACACTTA GGAAGAAATC ACTTAAATAC CTCTGGCTTA GAAATTTTTT CATGCACACT 34 80 

GTTGGAATGT ATGCTAATTG AACATGCAAT TGGGGAAGAA AAAAT GT AG A ATGATTTTTG 354 0 

CTATTTCTAG TAGAAAGAAA ATGTCTGTTT TCCAAAGATA AT GT TAT AC A TCCTATTTTG 360 0 

TAATTTTTTT GAAAAAAGTT CAATGTTCAG TTTTCCTTAGT TTTTACCTT 3660 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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iliJ MOLECULE TYPE: Ammo Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Val Gly Glu Glu Lys Met Ser Leu Arg Asn Arg Leu Ser Lys Ser 
1 5 10 



15 



Arg Glu Asn Pre Glu Glu Asp Glu Asp Gin Arg Asn Pro Ala Lys Glu 
20 25 ao 

Ser Leu Glu Thr Pro Ser Asn Gly Arg lie Asp He Lys Gin Leu He 
35 40 45 

Ala Lys Lys He Lys Leu Thr Ala Glu Ala Glu Glu Leu Lys Pro Phe 
50 55 go 

Phe Met Lys Glu Val Gly Ser His Phe Asp Asd Phe Val Thr Asn Leu 
65 70 75' * so 

He Glu Lys Ser Ala Ser Leu Asp Asn Gly Glv Cvs Ala Leu Thr ^h- 
65 90 95 

Phe Ser Val Leu Glu Gly Glu Lys Asn Asn His Arg Ala Lys Asp Leu 
1C0 105 no 

Arg Ala Pro Pro Glu Gin Gly Lys He Phe He Ala Arg Arg Ser Leu 
115 120 125 

Leu Asp Glu Leu Leu Glu Val Asp His He Arg Thr He Tyr His Met 
130 135 i4 0 

Phe He Ala Leu Leu He Leu Phe He Leu Ser Thr Leu Val Val Asp 
145 15 ° 155 i 6 o 

Tyr He Asp Glu Gly Arg Leu Val Leu Glu Phe Ser Leu Leu Ser Tyr 
165 170 17 c. 

Ala Phe Gly Lys Phe Pro Thr Val Val Trp Thr Trp Trp He Met Phe 
180 185 190 

Leu Ser Thr Phe Ser Val Pro Tyr Phe Leu Phe Gin His Trp Arq T hr 
195 200 205 

Gly Tyr Ser Lys Ser Ser His Pro Leu He Arg Ser Leu Phe His Glv ' 
210 215 220 

Phe Leu Phe Met He Phe Gin He Gly Val Leu Gly Phe Gly Pro Thr 
225 230 235 240 

Tyr Val Val Leu Ala Tyr Thr Leu Pro Pro Ala Ser Arg Phe He He 
245 250 255 

He Phe Glu Gin He Arg Phe Val Met Lys Ala His Ser Phe Val Arg 
260 265 270 

Glu Asn Val Pro Arg Val Leu Asn Ser Ala Lys Glu Lys Ser Ser Thr 
275 280 2 e5 

Val Pro He Pro Thr Val Asn Gin Tyr Leu Tyr Phe Leu Phe Ala Pro 
290 295 300 
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Th r Leu He Tyr Arg Asp Ser Tyr Pro Arg Asn Pro Thr Val Arg Trp 
305 310 315 ^ 320 

Gly Tyr Val Ala Met Lys Phe Ala Gin Val Phe Gly Cys Phe Phe — 
325 330 335 

Val Tyr Tyr He Phe Glu Arg Leu Cys Ala Pro Leu Phe Arg Asn He 
340 345 35Q 

Lys Gin Glu Pro Phe Ser Ala Arg Val Leu Val Leu Cys Val Phe Asn 
355 360 365 

Ser He Leu Pro Gly Val Leu He Leu Phe Leu Thr Phe Phe Ala Phe 
370 375 380 

Leu His Cys Trp Leu Asn Ala Phe Ala Glu Met Leu Arg Phe Glv Asn 
385 390 395 400 

Arg Met Phe Tyr Lys Asp Trp Trp Asn Ser Thr Ser Tyr Ser Asn t v - 
405 410 415 

Tyr Arg Thr Trp Asn Val Val Val His Asp Trp Leu Tyr Ty- Tyr Ala 
420 425 430 

Tyr Lys Asp Phe Leu Trp Phe Phe Ser Lys Arg Phe Lys Ser Ala Ala 
435 440 445 

Met Leu Ala Val Phe Ala Val Ser Ala Val Val His Glu Tyr Ala Leu 
450 455 460 

Ala Val Cys Leu Ser Phe Phe Tyr Pro Val Leu Phe Val Leu Phe Met 
465 470 475 • 480 

Phe Phe Gly Met Ala Phe Asn Phe He Val Asn Asp Ser Arg Lys Lvs 
485 490 495 

Pro He Trp Asn Val Leu Met Trp Thr Ser Leu Phe Leu Gly Asn Glv 
500 505 510 

Val Leu Leu Cys Phe Tyr Ser Gin Glu Trp Tyr Ala Arg Arg His Cys- 
515 520 525 

Pro Leu Lys Asn Pro Thr Phe Leu Asp Tyr Val Arg Pro Arg Ser Trp 
530 535 540 

Thr Cys Arg Tyr Val Phe 
545 550 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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GCCCTCCAGC TCTCTACTAA GACCGGTCGC 
CCACACATGG TCTCCCTCCT GCGTCAAAAT 
ATCCAGAATG AAACTGCACG GCTGCAGATT 
CGCCTGGATA TGCTGCACTC TGGTCGAACC 
GAAAATCTCC GGTGGTGCCA ACTCCTCAGG 
CCTGTGTTTC CGTACCGCAC CTTTTTAGCA 
CTTCTTCTGC TTTTTTCCTC TTTATCACAC 
TCGATAAAAT TGAAAAAT GT GAGATGGTGT 
G C CAT C AAAT ACCCGGCCGT GGTTGGACTC 
TGGCAACATG CGGATGATAT AAGAAGGACG 
ACACTCACGC AGGTGGTTGT TCAGCACGGC 
GAATGACGGA GACTAAGGA7 TTGTTGCAAG 
ATTCCGCAGA AGCCAACAAA CGGCATTCGG 
AGTCCATGGA GGTTTCGCCA CGGTCGTCTA 
CTGAAGGAGT GGAGTCGACT GAGGCGGAAC 
AGTACCCT3T GGACGCCCAC AT G C AAAAGT 
CGAGGTTCCA CCGAAAGGAT GCTAGCAAGT 
ATCCTCGCCC CACGCTCCTG GACAGCGCCA 
GTCCGGTGCT GGAGAAACAG CTCAAAAATT 
CGGTGAAGAC TACGGTGAAG ACTACGGAGA 
AAAAACTGGA GTCGAACTTT TCAGGGATCT 
TAGCCATCAG GTGCTGCACA GAT TACT AT G 
AAATCGTGCA GTACATGACA ACGGACTTGT 
TCCTGTGCAC TTTCTTCGTG GTTTTCGTGC 
GGAAGTGGAC TGGGTTCGTT GCAGTGAGCA 
TCCCCATTTA CGTCTACTAC TTTGATTTCA 
ACTCCGTGGT GTTTGTTATG AAGAGCCACT 
ACATAAAGCA GGAACTCGAG TACTCTTCCA 
CCCCAGAGAC CCGCGAGATT CTGCAAAAAA 
ACCAGACCAA GGATAACGAC TTCCCCAACA 
GTTTGTTCCC CGTCCTCGTG TACCAGATCA 
GGTATGTGTT GGAGAAGGTG TGCGCCATCA 
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AAG C AT G CT r; 
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60 
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240 
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TTGGCACCAT 


CTTCCTCATG 


ATGGTCACGG 


192 0 
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CACAGTTCTT CATGCACCCG GTGGCCATGC GCTGTATCCA GTTCCACAAC ACGCCCACCT 198 0 

TCGGCGGCTG GATCCCCGCC ACGCAAGAGT GGTTCCACCT GCTCTTCGAC ATGATTCCGG 2040 

GCTTCACTGT T CT GT AC AT G CTCACGTTTT AC AT GAT AT G GGACGCTTTA TTGAATTGCG 2100 

TGGCGGAGTT GAC C AGGTTT GCGGACAGAT ATTTCTACGG CGACTGGTGG AATTGCGTTT 2160 

CGTTTGAAGA GTTTAGCAGA ATCTGGAACG TCCCCGTTCA CAAATTTTTA CTAAGACACG 2220 

TGTACCACAG CTCCATGGGC GCATTGCATT TGAGCAAGAG CCAAGCTACA TT ATT TACTT 22 8 0 

TTTTCTTGAG TGCCGTGTTC CACGAAATGG CCATGTTCGC CATTTTCAGA AGGGTTAGAG 2 34 0 

GATATCTGTT CATGTTCCAA CTGTCGCAGT TTGTGTGGAC TGCTTTGAGC AACACCAAGT 2 4 00 

TTCTACCGGC AAGACCGCAG TTGTCCAACG TTGTCTTTTC GTTTGGTGTC TGTTCAGGGC 2460 

C C AGTATC AT TATGACGTTG TACCTGACCT TATGAACTGC CACCATACCA CGTGTGTCCC 2 52 0 

TCGCAAGCCC TTGATAGATA TACAATAGGG AATGGGCGTC CGTCCACCGT GGT C AAA GAC 2 580 
AGGGGCAAAG AGCTCCTAGG T 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61C ammo acids 
(BJ TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Thr Glu Thr Lys Asp Leu Leu Gin Asp Glu Glu Phe Leu Lys lie 
1 5 10 15 

Arg Arg Leu Asn Ser Ala Glu Ala Asn Lys Arg His Ser Val Thr Tyr 
20 25 30 

Asp Asn Val lie Leu Pro Gin Glu Ser Met Glu Val Ser Pro Arg Ser 
35 40 45 

Ser Thr Thr Ser Leu Val Glu Pro Val Glu Ser Thr Glu Gly Val Glu 
50 55 60 

Ser Thr Glu Ala Glu Arg Val Ala Gly Lys Gin Glu Gin Glu Glu Glu 
65 70 75 80 

Tyr Pro Val Asp Ala His Met Gin Lys Tyr Leu Ser His Leu Lys Ser 
85 90 95 

Lys Ser Arg Ser Arg Phe His Arg Lys Asp Ala Ser Lys Tyr Val Ser 
100 105 110 

Phe Phe Gly Asp Val Ser Phe Asp Pro Arg Pro Thr Leu Leu Asp Ser 
115 120 125 
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Ala He Asn Val Pro Phe Gin Thr Thr Phe Lvs Gly Pro Va^ Leu Glu 
13c 135 * 140 

Lys Gin Leu Lys Asn Leu Gin Leu Thr Lys Thr Lys Thr Lys Ala Thr 
145 150 155 leo 

Val Lys Thr Thr Val Lys Thr Thr Glu Lys Thr Asd Lys Ala Asp Ala 
165 170 175 

Pro Pro Gly Glu Lys Leu Glu Ser Asn Phe Ser Gly He Tyr Val Phe 
180 185 190 

Ala Trp Met Phe Leu Gly Trp He Ala He Arg Cys Cys Thr Asd Tvr 
195 200 205 

Tyr Ala Ser Tyr Gly Ser Ala Trp Asn Lys Leu Glu He Val Gin Tv- 
210 215 220 

Met Thr Thr Asp Leu Phe Thr He Ala Met Leu Asd Leu Ala Met Phe 
225 230 235 " 240 

Leu Cys Thr Phe Phe Val Val Phe Val His Trp Leu Val Lys Lvs Ara 
245 250 255 

He lie Asn Trp Lys Trp Thr Gly Phe Val Ala Val Ser He Phe Glu 
260 265 27C 

Leu Ala Phe He Pro Val Thr Phe Pro He Tyr Val Tyr Tyr Phe Asp 
275 280 285 

Phe Asn Trp Val Thr Arg He Phe Leu Phe Leu His Ser Val Val Phe 
2 ^0 295 300 

Val Met Lys Ser His Ser Phe Ala Phe Tyr Asn Gly Tyr Leu Tro Asd 
' 305 310 315 P 320 

He Lys Gin Glu Leu Glu Tyr Ser Ser Lys Gin Leu Gin Lys Tyr Lys 
325 330 335 

Glu Ser Leu Ser Pro Glu Thr Arg Glu He Leu Gin Lys Ser Cys Asd 
340 345 350 

Phe Cys Leu Phe Glu Leu Asn Tyr Gin Thr Lys Asp Asn Asp Phe Pro 
355 360 365 

Asn Asn He Ser Cys Ser Asn Phe Phe Met Phe Cys Leu Phe Pro Val 
37 ° 375 380 

Leu Val Tyr Gin He Asn Tyr Pro Arg Thr Ser Arg He Arg Trp Arq 
385 390 395 4 00 

Tyr Val Leu Glu Lys Val Cys Ala He He Gly Thr He Phe Leu Met 
405 410 415 

Met Val Thr Ala Gin Phe Phe Met His Pro Val Ala Met Arg Cys He 
420 425 430 

Gin Phe His Asn Thr Pro Thr Phe Gly Gly Trp He Pro Ala Thr Gin 
435 440 445 

Glu Trp Phe His Leu Leu Phe Asp Met He Pro Gly Phe Thr Val Leu 
450 455 460 
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Tyr Met Leu Thr Phe Tyr Met lie Trp Asp Ala Leu Leu Asn Cys Va* 

465 470 /i-jc / 

H/u 475 49 0 

Ala Glu Leu Thr Arg Phe Ala Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
48 5 490 495 

"Asn Cys Vai Ser Phe Glu Glu Phe Ser Arg He Trp Asn val Pro Val 
500 505 510 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Gly Ala Leu 
515 520 525 

-Hls Leu Ser Lys Ser Gin Ala Thr Leu Phe Thr Phe Phe Leu Ser A 1 a 
530 535 540 

Val Phe His Glu Met Ala Met Phe Ala He Phe Arg Arg Val Arq Glv 
'" 545 "0 555 560 

-Tyr Leu Phe Met Phe Gin Leu Ser Gin Phe Val Trp Thr Ala Leu Ser 
565 570 575 

Asn Thr Lys Phe Leu Arg Ala Arg Pro Gin Leu Ser Asn Val Val Phe 
580 585 590 

Ser Phe Gly Val Cys Ser Gly Pro Ser lie lie Met Thr Leu Tyr Leu 
595 600 605 

Thr Leu 
610 

{2) INFORMATION FOR SEQ ID NO : 5 : 

«(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ill MOLECULE TYPE: DNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TAT AAAATT C CTTTCATCAA TACATCTATA TATTCGAATA TATAGATAAA CCAATACAAA 60 

AACATACTGA AATTTTTTGA AAACAACTAA AACTATTCAT TGCAGTTACA CGTGAATGCT 12 0 

AAACTTTATA TCGCTCTTGT CGGTCCCGCG GAGTTAACAT TTAACGGCTT CTCGCGCAAT 18 0 

AACCGGAAAA ATTCCAACAG TTTCTTTGTA ATATTATTAA GCCTTCTTTT TTCCCGGAAT 2 40 

CTATAAGAGG GGACGAAAAT TAGCCGCTAT TAATTCTGGT ATTGCCACCT AGACAAGAAG 300 

TAAACAGACA CATTACGTTA GCAAAAGCAA CAATAACAAA C AC AAC CAT G GACAAGAAGA 3 60 

AGGATCTACT GGAGAACGAA CAATTTCTCC GCATCCAAAA GCTCAACGCT GCCGATGCGG 42 0 

GCAAAAGACA AT CT AT AAC A GTGGACGACG AGGGCGAACT AT AT GGGTT A GACACCTCCG 480 

GCAACTCACC AGCCAATGAA CACACAGCTA CCACAATTAC ACAGAATCAC AGCGTGGTGG 54 0 
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CCTCAAACGG AGACGTCGCA TTCATCCCAG 
TAACTGAAGA AGT GATT GAG ACCGATGATA 
GCTCCAAAGA GAAGGCACGG TATAGGCAAG 
AT AT GT CAT T TGAACACAGG C C C AGT AT AT 
CCAAATTCGT GGGACCTACT TTAGAAAAGG 
CCATGCGCAA AAATTTACAC CAC CGCAAGT 
AAAATGATGG CGCCGCCCCA ACTACTGTTC 
CCGTTGAAAC CACCATAATT TCATCCAATT 
CTATTGCATT TGGTGCTGTC AAGGCTTTAA 
TCAAGGATTC GGAGATCTTG AAATTTATGA 
ATCTTTTGAT GTATTTGAGC ACTTATTTTG 
GGGTCTTGAA ATGGGGCACT ACCGGCTGGA 
TTATCTTCTA CATGTATTTA ACAGAAAACA 
TCCTTTTTTT GCATTCTTTA GTTTTATTGA 
GCTATCTATG GGGTATAAAG GAAGAACTAC 
AGGATTCTAT AAATGATCCA AAAGTTATTG 
GTTTTGAATT GAGCTCTCAG TCTTTAAGCG 
GTGCAAAAAG CTTTTTTTGG TTCACCATGT 
CAAGAACTAA GGAAATCAGA TGGAGCTACG 
CCATTTTCTT AAT GAT GAT A GATGCTCAAA 
TGGCTGTGCG CAATTCTGAA TGGACTGGTA 
TGCTCGTTGA TATCGTCCCA GGGTTTATCG 
GGGATGCCAT TTTGAACTGT GTGGCTGAAT 
GTGACTGGTG GAATTGTGTT AGTTGGGCAG 
ATAAGTTTTT GTTAAGACAT GTTTACCATA 
GTCAAGCAAC TTTGATGACC TTTTTCTTAA 
TTATCTTCAA GAAATTGAGG TTTTACTTGT 
TAGCTTTAAC AAATACTAAA TTC AT GAGGA 
GGCTCGGTAT CTGCATGGGA CCAAGTGTCA 
TCCTGCAACT GTTCTGTGGA GCTATTAAAT 
TTTTTTTTTT TTTTTTTTTA TTATTTACAA 
GTCATTTATT ACATTGGTTC A 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
JD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Asp Lys Lys Lys Asp Leu Leu Glu Asn Glu Gin Phe Leu Arg lie 
1 5 10 is 

Gin Lys Leu Asn Ala Ala Asp Ala Gly Lys Arg Gin Ser lie Thr Va ' 
20 25 30 

Asp Asp Glu Gly Glu Leu Tyr Gly Leu Asd Thr Ser Gly Asn Ser Pro 
35 40 45 

Ala Asn Glu His Thr Ala Thr Thr He Thr Gin Asn His Ser Val Val 
50 55 60 

Ala Ser Asn Gly Asp Val Ala Phe He Pro Gly Thr Ala Thr Glu Gly 
65 70 75 80 

Asn Thr Glu He Val Thr Glu Glu Val He Glu Thr Asp Asp Asn Met 
85 90 95 

Phe Lys Thr His Val Lys Thr Leu Ser Ser Lys Glu Lys Ala Arg Tyr 
100 105 no 

Arg Gin Gly Ser Ser Asn Phe He Ser Tyr Phe Asp Asp Met Ser Phe 
115 120 125 

Glu His Arg Pro Ser He Leu Asp Gly Ser Val Asn Glu Pro Phe * ys 
130 135 140 

Thr Lys Phe Val Gly Pro Thr Leu Glu Lys Glu He Arg Arg Arg Glu 
145 150 155 160 

Lys Glu Leu Met Ala Met Arg Lys Asn Leu His His Arg Lys Ser Ser 
165 170 175 

Pro Asp Ala Val Asp Ser Val Gly Lys Asn Asp Gly Ala Ala Pro Thr 
180 185 190 

Thr Val Pro Thr Ala Ala Thr Ser Glu Thr Val Val Thr Val Glu Thr 
195 200 205 

Thr He He Ser Ser Asn Phe Ser Gly Leu Tyr Val Ala Phe T rp Met 
210 215 220 

Ala He Ala Phe Gly Ala Val Lys Ala Leu He Asp Tyr Tyr Tyr Gin 
225 230 235 240 

His Asn Gly Ser Phe Lys Asp Ser Glu lie Leu Lys Phe Met Thr Thr 
245 250 255 
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Asn Leu Phe Thr Val Ala Ser Val Asp Leu Leu Met Tyr Leu Ser - 
260 265 270 

Tyr Phe Val Val Gly He Gin Tyr Leu Cys Lys Trp Gly Val Leu Lvs 
2^5 280 285 

Trp Gly Thr Thr Gly Trp He Phe Thr Ser He Tyr Glu Phe Leu ^he 
290 295 300 

Val He Phe Tyr Me: Tyr Leu Thr Glu Asn He Leu Lys Leu His ^rp 
305 310 315 ^20 

Leu Ser Lys He Phe Leu Phe Leu His Ser Leu Val Leu Leu Met Lys 
3 25 330 335 

Met His Ser Phe Aia Phe Tyr Asn Gly Tyr Leu TrD Gly He Lys Glu 
340 345 350 

Glu Leu Gin Phe Ser Lys Ser Ala Leu Ala Lys Tyr Lys Asp Ser lie 
355 360 365 

Asn Asp Pre Lys Val lie Gly Ala Leu Glu Lys Ser Cys Glu Phe Cvs 
37 ° 375 380 

Ser Phe Glu Leu Ser Ser Gin Ser Leu Ser Asp Gin Thr Gin Lys Phe - 
385 390 395 400 

Pro Asn Asn He Ser Ala Lys Ser Phe Phe Trp Phe Thr Met Phe Pro 
^05 410 415 

Thr Leu He Tyr Gin He Glu Tyr Pro Arg Thr Lys Glu He Arg Trp 
420 425 430 

Ser Tyr Val Leu Glu Lys He Cys Ala He Phe Gly Thr lie Phe Leu 
435 440 445 

Met Mec He Asp Aia Gin He Leu Met Tyr Pro Val Ala Met Arq Ala 
450 455 460 

Leu Ala Val Arg Asn Ser Glu Trp Thr Gly He Leu Asp Arg Leu Leu 
465 470 475 480 

Lys Trp Val Gly Leu Leu Val Asp lie Val Pro Gly Phe He Val Met 
485 490 495 

Tyr He Leu Asp Phe Tyr Leu lie Trp Asp Ala He Leu Asn Cys Val 
500 505 510 

Ala Glu Leu Thr Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp 
515 520 525 

Asn Cys Val Ser Trp Ala Asp Phe Ser Arg lie Tro Asn He Pro Val 
^30 535 540 

His Lys Phe Leu Leu Arg His Val Tyr His Ser Ser Met Ser Ser Phe 
545 550 555 560 

Lys Leu Asn Lys Ser Gin Ala Thr Leu Met Thr Phe Phe Leu Ser Ser 
565 570 575 

Val Val His Glu Leu Ala Met Tyr Val lie Phe Lys Lys Leu Arg Phe 
580 585 590 
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Tyr Leu Phe Phe Phe Gin Met Leu Gin Met: Pro Leu Val Ala Leu Thr 
595 600 605 

Asn Thr Lys Phe Met Arg Asn Arg Thr He He Gly Asn Val He Phe 
610 615 620 

Trp Leu Gly lie Cys Met Gly Pro Ser Val Met Cys Thr Leu Tyr Leu 
625 630 635 640 

Thr Phe 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATGGAGCTCA ACTTTCCCCG CTCTCCCCGC ATCCGGAAGC GCTTTCTGCT GCGACGGATC 60 

CTTGAGATGC TGTTCTTCAC CCAGCTCCAG GTGGGGCTGA TCCAGCAGTG GATGGTCCCC 12 0 

ACCATCCAGA ACT C CAT GAA GCCCTTCAAG GACATGGACT ACTCACGCAT CATCGAGCGC 180 

CTCCTGAAGC TGGCGGTCCC CAATCACCTC ATCTGGCTCA TCTTCTTCTA CTGGCTCTTC 24 0 

CACTCCTGCC TGAATGCCGT GGCTGAGCTC ATGCAGTTTG GAG AC CGGGA GTTCTACCGG 3 00 

GACTGGTGGA ACTCCGAGTC TGTCACCTAC TTCTGGCAGA ACT GGAAC AT CCCTGTGCAC 3 60 

AAGTGGTGCA TCAGACACTT CTACAAGCCC ATGCTTCGAC GGGGCAGCAG CAAGTGGATG 42 0 

GC CAGGAC AG GGGTGTTCCT GGCCTCGGCT TTCTTCCACG AGTACCTGGT GAGCGTCCCT 4 80 

CTGCGAATGT TCCGCCTCTG GGCTTTCACG GGCATGATGG CTCAGATCCC ACTGGCCTGG 54 0 

TTCGTGGGCC GCTTTTTCCA GGGCAACTAT GGCAACGCAG CTGTGTGGCT GTCGCTCATC 600 

ATCGGACAGC CAATAGCCGT CCTCATGTAC GTCCACGAAC T AC T AC GT G C TCAACTATGA 660 

GGCCCCAGCG GCAGAGGCCT GAGCTGCACC TGAGGGCCTG GCTTCTCACT GCCACCTCAA 72 0 

ACCCGCTGCC AGAGCCCACC TCTCCTCCTA GGCCTCGAGT GCTGGGGATG GGCCTGGCTG 7 80 

CACAGCATCC TCCTCTGGTC CCAGGGAGGC CTCTCTGCCC TATGGGGCTC TGTCCTGCAC 8 40 

CCCTCAGGGA TGGCGACAGC AGGCCAGACA CAGTCTGATG CCAGCTGGGA GTCTTGCTGA 9 00 

CCCTGCCCCG GGTCCGAGGG TGTCAATAAA GTGCTGTCCA GTGAGAAAAA GAAAAAAAAA 960 
AAAAAAAAAA ATTCTGCGGC CGC 
(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 8 ; 



Met Glu Leu Asn Phe Pro Arg Ser Pro Arg He Arg Lys Arg Phe Leu 
15 10 15 

Leu Arg Arg He Leu Glu Met Leu Phe Phe Thr Gin Leu Gin Val G 1 v 
20 25 30 

Leu He Gin Gin Trp Met Val Pro Thr He Gin Asn Ser Met Lys Pre 
35 40 45 

Phe Lys Asp Met Asp Tyr Ser Arg He He Glu Arg Leu Leu Lys Leu 
50 55 60 

Ala Val Pro Asn His Leu lie Trp Leu He Phe Phe Tyr Trp Leu Phe 
65 7 0 75 80 

His Ser Cys Leu Asn Ala Val Ala Glu Leu Met Gin Phe Gly Asp Arg 
85 90 95 

Glu Phe Tyr Arg Asp Trp Trp Asn Ser Glu Ser Val Thr Tyr Phe Trp 
100 105 110 

Gin Asn Trp Asn He Pro Val His Lys Trp Cys He Arg His Phe Tyr 
H 5 120 125 

Lys Pro Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Thr Gly 
130 135 140 

Val Phe Leu Ala Ser Ala Phe Phe His Glu Tyr* Leu Val Ser Val Pro 
145 150 155 160 

Leu Arg Met Phe Arg Leu Trp Ala Phe Thr Gly- Met Met Ala Gin He 
165 170 175 

Pro Leu Ala Trp Phe Val Gly Arg Phe Phe Gin Gly Asn Tyr Gly Asn 
180 185 190 

Ala Ala Val Trp Leu Ser Leu He He Gly Gin Pre He Ala Val Leu 
195 200 205 

Met Tyr Val His Glu Leu Leu Arg Ala Gin Leu 
210 215 

(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGTTGAACT TCATGATGCA TGACCAGCGC ACCGGCCCGG CATGGAACGT GCTGATGTGG 60 

ACCATGCTGT TTCTAGGCCA GGGAATCCAG GTCAGCCTGT ACTGCCAGGA GTGGTACGCA 120 

CGGACGCACT GCCCCTTACC CCAGGCAACT TTCTGGGGGC TGGTGACACC TCGATCTTGG 180 

TCCTGCCATA CCTAGAGGTC GGGACAGACG ACGCTACCTG CCCAGACACC ACCAAGTTCT 24 0 

CTGCCTGCM AACCTGGGGA CCAGGACTTC CTGTCTTGCA TTCCCAAATT TGGGTTCTTG 300 

AGTCGAGGCA ACCTTGCACA CAAGACCCCA CCAAGGGATT GTTGCAAGGG ATTAGATTTT 3 60 

GCAGATTTGT TGGGTAATGA TTCAACGACT CAGCTGGGGG TTGAC TJAGGG TTGATTTTTC 42 0 

AATCCTTTTC CCCTGGGTTT GGGTTACAGG TTTTT 455 
(2) INFORMATION FOR SEQ ID NO: 10: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 

Met Leu Asn Phe Met Met His Asp Gin Arg Thr Gly Pro Ala Trp Asn 
1 5 10 15 

Val Leu Met Trp Thr Met Leu Phe Leu Gly Gin Gly lie Gin Val Ser 

20 25 30 

Leu Tyr Cys Gin Glu Trp Tyr Ala Arg Thr His Cys Pro Leu Pro Gin 
35 40 45 

Ala Thr Phe Trp Gly Leu Val Thr Pro Arg Ser Trp Ser Cys His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 

AT GGACAACG CGGGGTCTGA TACGACTCAC TATAGGGAAT TTGGCCCTCG AGCAGTAGAT 60 

TCGGCACGAT GGGCACGAGG ACTCCATCAT GTTCCTCAAG CTTTATTCCT ACCGGGATGT 120 

CAACCTGTGG TGCCGCCAGC GAAGGGTCAA GGCCAAAGCT GTCTCTACAG GGAAGAAGGT 180 

CAGTGGGGCT GCTGCGAGCA AGCTGTGAGC TATCCAGACA ACCTGACCTA CCGAGATCTC 2 40 

GATTACTTCA TCTTTGCTCC TACTTTGTGT TATGAACTCA ACTTTCCTCG GTCCCCCCGA 3 00 

AT AC GAGAGC GCTTTCTGCT AC GAC GAGTT CTTGAGATGC TCTTTTTTAC CCAGCTTCAA 360 

GTGGGGCTGA TCCAACAGTG GATGGTCCCT ACTATCCAGA ACTCCATGGA AGCCCTTTCA 42C 

AGAGCTTCTG GCAGTTTTGG AGACCGCGAG TTCTACAGAG ATTGGTGGAA TGCTGAGTCT 4 80 

GTCACCGACT TTTGGCAGAA CTGGAATATC CCCGTGG 517 
(2) INFORMATION FOR SEQ ID NO : 1 2 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asp Asn Ala Gly Ser Asp Thr Thr His Tyr Arg Glu Phe Gly Pro 
1 5 io 15 

Arg Ala Vai Asp Ser Ala Arg Trp Ala Arg Gly Leu His His Val P^o 
20 25 30 

Gin Ala Leu Phe Leu Pro Gly Cys Gin Pro Val Val Pro Pro Ala Lys 
35 40 45 

Gly Gin Gly Gin Ser Cys Leu Tyr Arg Glu Glu Gly Gin Trp Gly Cys 
5 ° 55 60 

Cys Glu Gin Ala Val Ser Tyr Pro Asp Asn Leu Thr Tyr Arq Asd Leu 
65-70 75 80 

Asp Tyr Phe lie Phe Ala Pro Thr Leu Cys Tyr Glu Leu Asn Phe Pro 
85 90 95 

Arg Ser Pro Arg lie Arg Glu Arg Phe Leu Leu Arg Arg Val Leu Glu 
100 105 no 

Met Leu Phe Phe Thr Gin Leu Gin Val Gly Leu He Gin Gin Tro Met 
115 120 125 

Val Pro Thr He Gin Asn Ser Met Glu Ala Leu Ser Arg Ala Ser Gly 
130 135 140 



Ser Phe Gly Asp Arg Glu Phe Tyr Arg Asp Trp Trp Asn Ala Gl 



u Ser 
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145 150 155 160 

Val Thr Asp Phe Trp Gin Asn Trp Asn He Pro Val 
165 17C 

(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



{xi : SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Lys Asp Leu Leu Glu Phe Leu Lys He Arg Leu Asn Ala Asp Ala 
1 5 io is 

Lys Arg Ser Thr Asp Ser Pro Thr Val Ser Glu Val Glu Arg Gly Lys 
20 25 30 

Gin Glu He Glu Ala His Lys Ser Lys Lys Arg Phe Arg Ser Phe Ser 
35 40 45 

Phe Phe Asp Ser Phe Glu Arg Pro Ser Leu Leu Asp Gly Asn Pro Phe 
SO 55 " 60 

Thr Thr Phe Gly Pro Val Leu Glu Lys Glu Lys Asn Leu His Lys Lys 
65 "70 75 80 

Lys Thr Thr Val Thr Asp Val Ser Asn Phe Ser Gly He Tyr Val Phe 
85 90 95 

Trp Met Leu Ala Leu Asp Tyr Tyr Gly Glu He Leu Tyr Met Thr Thr 
100 105 HO 

Leu Phe Thr Val Ala Asp Leu Met Phe Leu Ser Thr Phe Phe Val Val 
115 120 125 

Leu Lys Trp Thr Gly He Ser He Glu Phe Leu Phe He Phe Leu Trp 
130 135 140 

Ser Arg He Phe Leu Phe Leu His Ser Val Phe Val Met Lys His Ser 
145 150 155 160 

Phe Ala Phe Tyr Asn Gly Tyr Leu Trp He Lys Glu Glu Leu Ser Leu 
165 170 175 

Lys Tyr Lys Glu Ser Ser Pro Leu Gin Lys Ser Cys Phe Cys Phe Glu 
180 185 190 

Leu Gin Phe Pro Asn Asn He Ser Phe Phe Phe Phe Pro Thr Leu He 
195 200 205 

Tyr Gin He Tyr Pro Arg Thr He Arg Trp Tyr Val Leu Glu Lys Cys 
210 215 „ 220 

Ala He Phe Gly Thr He Phe Leu Met Met Ala Gin Met Pro Val Ala 
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225 230 2 35 24C 

Met Arg Asn Phe Trp Gin Leu Leu Asp lie Pro Gly Phe Vai Leu Tyr 
245 2 5o 255 

Leu Thr Phe Tyr He Trp Asp Ala Leu Asn Cys Vai Ala Glu Leu Thr 
260 265 270 

Arg Phe Gly Asp Arg Tyr Phe Tyr Gly Asp Trp Trp Asn Cys Vai Ser 
275 280 285 

Phe Ser Arg He Trp Asn Vai Pro Vai His Lys Phe Leu Leu Arq His 
290 295 300 

Vai Tyr His Ser Ser Met Phe Lys Leu Lys Ser Gin Ala Thr Leu Thr 
305 310 315 320 

Phe Phe Leu Ser Ala Vai Vai His Glu Ala Met Vai He Phe Arg Tyr 
325 330 335 

Leu Phe Phe Gin Gin Met Ala Leu Asn Thr Lys Phe Arg A-g He Asn 
34° 345 350 

vai Phe Trp Gly Cys Gly Pro- Ser Vai Thr Leu Tyr Leu Thr 

360 365 

(2) INFORMATION FOR SEQ ID NO : 1 4 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Amino Acid 



(Xil SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 : 

Pro Asn His Leu He Trp Leu He Phe Phe Tyr Trp Leu Phe His Ser 
1 5 io 



15 



Cys Leu Asn Ala Vai Ala Glu Leu Met Gin Phe Gly Asp Arg Glu Phe 
20 25 30 

Tyr Arg Asp Trp Trp Asn Ser Glu Ser Vai Thr Tyr Phe Trp Gin Asn 
35 40 45 

Trp Lys He Pro Vai His Lys Trp Cys He Arg His Phe Tyr Lys Pro 
50 55 60 

Met Leu Arg Arg Gly Ser Ser Lys Trp Met Ala Arg Asp Arg Gly Vai 
65 70 75 80 

Pro Gly Pro Ser Ala Phe Phe His Vai Vai Thr Trp Vai Ser Vai Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO : 1 5 : 
(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: other nucleic acid 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGGGGACGA AAATTAGCCG CTATTAATTC TGGTATTGCC ACCTAGACAA GAAGTAAACA 
GACACAGATG CAAGAGTTCG AATCTCTTAG C 
(2) INFORMATION FOR SEQ ID NO : 1 6 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
<DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 
C T AT AAAGAT TTAATAGCTC CACAGAACAG TTGCAGGATG CCTTAGGGTC GACTACGTCG 
TAAGGCCGTT TCTGAC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CATTGCAGTT ACACGTGAAT GC 
(2) INFORMATION FOR SEQ ID NO : 1 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(Ki; SEQUENCE DESCRIPTION: SEQ ID NC : 1 8 : 
TAGCTCCACA GAACAGTTGC AGG 
(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CTCTGACAAC AACGAAGTCA G 
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What is claimed is: 



1 • An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyl transferase II. 

5 

2. An isolated nucleic acid which encodes an 
acylcoenzyme A: cholesterol acyltransf erase III. 

3. The isolated nucleic acid of claim 1 or 2, wherein 
10 the nucleic acid is DNA or RNA . 

4. The isolated nucleic acid of claim 3, wherein the 
nucleic acid is cDNA or genomic DNA. 

15 5. The isolated nucleic acid of claim 1 comprising a 

nucleic acid having the sequence as set forth in 
Figure 15. 

6. The isolated nucleic acid of claim 1, wherein the 
20 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyl trans f erase II having 
substantially the same amino acid sequence as set 
forth in Figure 15. 

25 7. The isolated nucleic acid of claim 2, comprising a 

nucleic acid having the sequence as set forth in 
Figure 16. 

8. The isolated nucleic acid of claim 2, wherein the 
30 nucleic acid encodes a human wildtype acylcoenzyme 

A: cholesterol acyl trans f erase ill having 
substantially the same amino acid sequence as set 
forth in Figure 16. 

35 9. The isolated nucleic acid of claim 1 comprising a 
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nucleic acid having the sequence designated Seq. 
I - D. No . : 11 . 

10. The isolated nucleic acid of claim 1, wherein the 
> nucleic acid encodes a mouse wildtype acylcoenzyme 

A: cholesterol acyltransf erase II having 
substantially the same amino acid sequence as the 
sequence designated Seq. I.D. No.: 12. 

11. The isolated nucleic acid of claim 1, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II. 

12. The isolated nucleic acid of claim 2, wherein the 
nucleic acid encodes a mutant acylcoenzyme A: 
cholesterol acyl trans f erase III. 

13. A vector comprising the isolated nucleic acid of 
claim 1 or 2 . 

14. The vector of claim 13 further comprising a promoter 
of RNA transcription operatively linked to the 
nucleic acid. 

15. The vector of claim 14, wherein the promoter 
comprises a bacterial, yeast, insect or mammalian 
promoter . 

16. The vector of claim 14, further comprising plasmid, 
cosmid, yeast artificial chromosome (YAC) , 
bacteriophage or eukaryotic viral DNA. 



17. The vector of claim 14 designated YEpAB-ACAT2 . 
35 18. A host vector system for the production of a 



BNSOOCID: <WO 9746438A1 JL> 



10 



WO 97/45439 * ••PCT/CS 97/09*60 

-78- 

polypeptide which comprises the vector of claim 14 
in a suitable host. 

19. The host vector system of claim 18, wherein the 
> suitable host is a prokaryotic or eukaryotic cell. 

20. The host vector system of claim 19, wherein the 
prokaryotic cell is a bacterial cell. 

21. The host vector system of claim 19, wherein the 
eukaryotic cell is a yeast, insect, plan: or 
mammalian cell. 

22. A method for producing a polypeptide which comprises 
15 growing the host vector system of claim 18 under 

suitable conditions permitting production of the 
polypeptide and recovering the polypeptide so 
produced. 

20 23. A method of obtaining a polypeptide in purified form 

which comprises: 

(a) introducing the vector of claim 14 into a 
suitable host cell; 

(b) culturing the resulting cell so as to produce 
25 the polypeptide; 

<£> recovering the polypeptide produced in step 
(b) ; and 

(d) purifying the polypeptide so recovered. 

30 24. A purified wildtype acylcoenzyme A: cholesterol 
acyltransf erase II . 

25. A purified mutant acylcoenzyme A: cholesterol 
acyltransf erase II. 

35 
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26. A purified wildtype acylcoenzyme A: cholesterol 
acy I transferase III . 

27. A purified mutant acylcoenzyme A: cholesterol 
5 acyl transferase III. 

28. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within a nucleic 

10 acid which encodes a wildtype acylcoenzyme A: 

cholesterol acyl trans f erase II without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyl trans f erase II. 

15 29. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II without hybridizing 

20 to a nucleic acid which encodes a wildtype 

acylcoenzyme A: cholesterol acyl trans f erase II. 

30. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 

25 sequence of nucleotides present within a nucleic 

acid which encodes a wildtype acylcoenzyme A: 
cholesterol acyl transferase III without hybridizing 
to a nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyl trans f erase III. 

30 

31. An oligonucleotide of at least 15 nucleotides 
capable of specifically hybridizing with a unique 
sequence of nucleotides present within the nucleic 
acid which encodes a mutant acylcoenzyme A: 

35 cholesterol acyltransf erase III without hybridizing 
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to a nucleic acid which encodes a wildtype 
acylcoenzyme A : cholesterol acyltransf erase III. 

The oligonucleotide of claim 28, 29, 30 or 31 
wherein the nucleic acid is DNA or RNA . 

A nucleic acid having a sequence complementary to 
the sequence of the isolated nucleic acid of claim 
1 or 2 . 

A method for determining whether a subject known to 
have an imbalance in sterol levels has the imbalance 
due to a defect in es ter i f icat ion of sterol which 
comprises : 

(a) obtaining from the subject an appropriate 
sample containing a mixture of all of the 
subject's nucleic acids; and 

(b) determining whether any nucleic acid in the 
sample from step (a) is, or is derived from, a 
nucleic acid which encodes a mutant 
acylcoenzyme A: cholesterol acyltransf erase so 
as to thereby determine whether the subject's 
imbalance in sterol levels is due to a defect 
in esterif ication of sterol. 

The method of claim 34, wherein the determining of 
step (b) comprises: 

(!) contacting the sample of step (a) with the 

isolated nucleic acid of claim 11 or 12 or 
the oligonucleotide of claim 29 or 31 
under conditions permitting binding of any 
nucleic acid in the sample which is, or is 
derived from, a nucleic acid which encodes 
a mutant acylcoenzyme A: cholesterol 
acyltransf erase to the nucleic acid or 



10 
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oligonucleotide so as to form a complex; 
isolating the complex so formed; and 
(iii) identifying the nucleic acid in the 

isolated complex so as to thereby 
5 determine whether any nucleic acid in the 

sample contains a nucleic acid which is, 
or is derived from, a nucleic acid which 
encodes a mutant acylcoenzyme A: 
cholesterol acyltransf erase II or III. 

The method of claim 35, wherein the isolated nucleic 
acid or the oligonucleotide is labeled with a 
detectable marker. 



36 



15 37. The method of claim 36, wherein the detectable 
marker is a radioactive isotope, a fluorophore or an 
enzyme . 



25 



30 



38 



AO 



The method of claim 35, wherein the nucleic acid 
20 sample is first bound to a solid matrix before 

performing step (I). 

39. The method of claim 35, wherein the sample comprises 
blood or sera. 

A method for treating a subject who has an imbalance 
in sterol levels due to a defect in. es ter i f icat ion 
of sterol which comprises introducing the isolated 
nucleic acid of claim 1 or 2 into the subject under 
conditions such that the nucleic acid expresses a 
wildtype acylcoenzyme A: cholesterol acyl trans f erase 
II or III, so as to thereby treat the subject. 

41. A method for inhibiting wildtype acylcoenzyme A: 
35 cholesterol acyl trans f erase II or III in a subject 
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which comprises transforming appropriate cells from 
the subject with a vector which expresses the 
nucleic acid of claim 33, and introducing the 
transformed cells into the subject so as to thereby 
inhibit wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III. 

42. The method of claim 41, wherein the nucleic acid of 
claim 33 is capable of specifically hybridizing to 
a mRNA molecule encoding acylcoenzyme A: cholesterol 
acyltransf erase II or III so as to prevent 
translation of the mRNA molecule. 

43. A method for inhibiting the wildtype acylcoenzyme A: 
cholesterol acyltransf erase II or III in a subject 
which comprises introducing the oligonucleotide of 
claim 28 or 30 into the subject so as to thereby 
inhibit the wildtype acylcoenzyme A: cholesterol 
acyltransf erase II or III. 

44. The method of claim 43, wherein the oligonucleotide 
of clam 28 or 30 is capable of specifically 
hybridizing to a mRNA molecule encoding acylcoenzyme 
A: cholesterol acyl trans f erase II or III so as to. 
prevent translation of the mRNA molecule. 

45. A method for identifying a chemical compound which 
is capable of inhibiting acylcoenzyme A: cholesterol 
acyl transferase II or III in a subject which 

3 0 comprises : 

(a) contacting a wildtype acylcoenzyme A: 
cholesterol acyl transferase II or III with the 
chemical compound under conditions permitting 
binding between the acylcoenzyme and the 

35 chemical compound; 
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(b) detecting specific binding of the chemical 

compound to the acylcoenzyme; and 
© determining whether the chemical compound 
inhibits the activity of the coenzyme so as to 
5 identify a chemical compound which is capable 

of inhibiting acylcoenzyme A: cholesterol 
acyltransf erase II ir III in a subject. 

46. A pharmaceutical composition comprising the chemical 
10 compound identified by the method of claim 45 in an 

amount effective to inhibit acylcoenzyme A: 
cholesterol acyltransf erase II or III in a subject 
and a pharmaceutical^ effective carrier. 

15 47 • A method of treating a subject who has 

atherosclerosis comprising administering the 

pharmaceutical composition of claim 46 to the 
subj ect . 

20 48. A method of treating a subject who has 

hyperlipidemia comprising administering the 

pharmaceutical composition of claim 46 to the 
subj ect . 

25 49. A transgenic, nonhuman mammal comprising the 
isolated nucleic acid of claim 1 or 2 . 
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FIG. 2A 
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FIG. 2B 
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FIG. 2C 
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